CHLAMYDIA PROTEIN, GENE SEQ TTF.WGE AND USES THEREOF 

1. FIELD OF THE INVENTION 

The present invention generally relates to a high 
5 molecular weight ("HMW") protein of Chlamydia, the amino acid 
sequence thereof, and antibodies, including cytotoxic 
antibodies, that specifically bind the HMW protein. The 
invention further encompasses prophylactic and therapeutic 
compositions comprising the HMW protein, a fragment thereof, 
10 or an antibody that specifically binds the HMW protein or a 
portion thereof or the nucleotide sequence encoding the HMW 
protein or a fragment thereof, including vaccines. The 
invention additionally provides methods of preventing, 
treating or ameliorating disorders in mammals and birds 
15 related to Chlamydia infections and for inducing immune 
responses to Chlamydia. The invention further provides 
isolated nucleotide sequences and degenerate sequences 
encoding the HMW protein, vectors having said sequences, and 
host cells containing said vectors. Diagnostic methods and 
20 kits are also included. 

2. BACKGROUND OF THE INVENTION 

Chlamydia are prevalent human pathogens causing 
disorders such as sexually transmitted diseases, respiratory 
25 diseases including pneumonia, neonatal conjunctivitis, and 
blindness. Chlamydia are obligate intracellular bacteria 
that infect the epithelial lining of the lung, conjunctivae 
or genital tract. The most common species of Chlamydia 
include Chlamydia trachomatis. Chlamydia psittaci. Chlamydia 

30 pecorum and Chlamydia pneumoniae. Recently, the newly 

designated species of Chlamydia, C. pneumoniae (formerly C. 
trachomatis TWAR) , has been implicated as a major cause of 
epidemic human pneumonitis and perhaps may play a role in 
atherosclerosis . 

35 There are currently 18 recognized C. trachomatis 

serovars, causing trachoma and a broad spectrum of sexually 
transmitted diseases: with the A, B and C serovars being most 
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frequently associated with trachoma, while the D-K serovars 
are the most common cause of genital infections. 

C. trachomatis is the major cause of sexually 
transmitted disease in many industrialized countries, 
5 including the United States. While the exact incidence of C. 
trachomatis infection in the U.S. is not known, current 
epidemiological studies indicate that more than 4 million 
chlamydial infections occur each year, compared to an 
estimated 2 million gonococcal infections. While all racial, 

10 ethnic and socioeconomic groups are affected, the greatest 
prevalence of chlamydial infections occur among young, 12 to 
20 year-old, sexually active individuals. Most genitourinary 
chlamydial infections are clinically asymptomatic. Prolonged 
carriage in both men and women is common. As many as 25% of 

15 men and 75% of women diagnosed as having chlamydial 
infections have no overt signs of infection. As a 
consequence, these asymptomatic individuals constitute a 
large reservoir that can sustain transmission of the agent 

within the community. 
20 Far from being benign, serious disease can develop 

from these infections including: urethritis, lymphogranuloma 
venereum (LGV) , cervicitis, and epididymitis in males. 
Ascending infections from the endocervix commonly gives rise 
to endometritis, pelvic inflammatory disease (PID) and 
. 25 salpingitis which can cause tubal occlusion and lead 
ultimately to infertility. 

C. trachomatis infection of neonates results from 
perinatal exposure to the mother's infected cervix. Nearly 
70% of neonates born vaginally to mothers with chlamydial 
30 cervicitis become infected during delivery. The mucus 

membranes of the eye, oropharynx, urogenital tract and rectum 
are the primary sites of infection. Chlamydial 
conjunctivitis has become the most common form of ophthalmia 
neonatorum. Approximately 20-30% of exposed infants develop 
35 inclusion conjunctivitis within 14 days of delivery even 
after receiving prophylaxis with either silver nitrate or 
antibiotic ointment. C. trachomatis is also the leading cause 
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of infant pneumonia in the United States. Nearly 10-20% of 
neonates delivered through an infected cervix will develop 
chlamydial pneumonia and require some type of medical 
intervention. 

5 in developing countries, ocular infections of 

C. trachomatis cause trachoma, a chronic follicular 
conjunctivitis where repeated scar formation leads to 
distortion of the eyelids and eventual loss of sight. 
Trachoma is the world's leading cause of preventable 
10 blindness. The World Health Organization estimates that over 
500 million people worldwide, including about 150 million 
children, currently suffer from active trachoma and over 6 
million people have been blinded by this disease. 

in industrialized countries, the costs associated 
15 with treating chlamydial infections are enormous. In the 
U.S., the annual cost of treating these diseases was 
estimated at $2.5-3 billion in 1992 and has been projected to 
exceed $8 billion by the year 2000. 

one potential solution to this health crisis would 
20 be an effective chlamydial vaccine. Several lines of 
evidence suggest that developing an effective vaccine is 

feasible. ^ *.v. *- 

Studies in both humans and primates have shown that 

short-term protective immunity to C. trachoTnatis can be 

25 produced by vaccinating with whole Chlamydia. However, 

protection was characterized as short lived, serovar 

specific, and due to mucosal antibody. Additionally, in some 

vaccinees disease was exacerbated when these individuals 

became naturally infected with a serovar different from that 

30 used for immunization. This adverse reaction was ultimately 

demonstrated to be due to a delayed-type hypersensitivity 

response. Thus, the need exists to develop a subunit-based 

chlamydial vaccine capable of producing an efficacious but 

nonsensitizing immune response. Such a subunit vaccine may 

35 need to elicit both mucosal neutralizing secretory IgA 

antibody and/or cellular immune response to be efficacious. 
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Subunit vaccine development efforts to date have 
focused almost exclusively on the major outer membrane 
protein (MOMP) MOMP is an integral membrane protein of 
approximately 40 kDa in size and comprises up to about 60% of 
the infectious elementary body (EB) membrane protein 
(Caldwell, H.D., J.Kromhout, and L.Schachter. 1981. Infect^ 
Itamun .. 31:1161-1176) . MOMP imparts structural integrity to 
the extracellular EB and is thought to function as a porin- 
like molecule when the organism is growing intracellular ly 
and is metabolically active. With the exception of four 
surface exposed variable domains (VDI-VDIV) , MOMP is highly 
conserved among all 18 serovars. MOMP. is highly immunogenic 
and can elicit a local neutralizing anti-Chlamydia antibody. 
However, problems exists with this approach. 

TO date, most MOMP-specif ic neutralizing epitopes 
that have been mapped are located within the VD regions and 
thus give rise only to serovar-specif ic antibody. Attempts 
to combine serovar-specif ic epitopes in various vaccine 
vectors (e.g. poliovirus) to generate broadly cross-reactive 
neutralizing antibodies have been only marginally successful 
(Murdin, A.D., H. su, D.S. Manning, M.H, Klein, M.J, Parnell, 
and H.D. Caldwell. 1993. Tnfeot. Immun. , 61:4406-4414; 
Murdin, A.D., H. Su, M.H. Klein, and H.D. Caldwell. 1995. 
Tnfect. Immun. . 63:1116-1121). 
5 Two other major outer membrane proteins in C. 

trachomatis, the 60 kDa and 12 kDa cysteine-rich proteins, as 
well as the surface-exposed lipopolysaccharide, are highly 
immunogenic but, unlike MOMP, have not been shown to induce a 
neutralizing antibody (Cerrone et al., 1991, Tnfect. Immun. , 
0 59:79-90). Therefore, there remains a need for a novel 
subunit-based chlamydial vaccine. 

3. SUMMARY OF THE IN VENTION 

An Object of the present invention is to provide an 
5 isolated and substantially purified high molecular weight 
protein of a Chlamydia sp. ("HMW protein"), wherein the HMW 
protein has an apparent molecular weight of about 105-115 
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kDa, as determined by SDS-PAGE, or a fragment or analogue 
thereof. Preferably the HMW protein has substantially the 
amino acid sequence of any of SEQ ID Nos.: 2, 15 and 16. 
Preferred fragments of the HMW protein include SEQ ID Nos: 3, 
5 17, and 25-37. AS used herein, "substantially the sequence- 
is intended to mean that the sequence is at least 80%, more 
preferably at least 90% and most preferably at least 95% 
identical to the referenced sequence. Preferably, the HMW 
protein is an outer membrane protein. More preferably, the 
10 outer membrane HMW protein is surface localized. Preferably, 
the HMW protein has a heparin binding domain. Preferably, 
the HMW Protein has a porin-like domain. It is intended that . 
all species of Chlamydia are included in this invention, 
however preferred species include Chlamydia trachomatis, 
15 Chlamydia psittaci, Chlamydia percornm and Chlamydia 

pneumoniae. The substantially' purified HMW protein is at 
least 70 wt% pure, preferably at least about 90 wt% pure, and 
may be in the form of an aqueous solution thereof. 

Also included in this invention are recombinant 
20 forms of the HMW protein, wherein in transformed E. coli 

cells, the. expressed recombinant form of the HMW protein has 
an apparent molecular weight of about 105-115 kDa, as 
determined by SDS-PAGE, or a fragment or analogue thereof. 
The term HMW-derived polypeptide is intended to include 
25 fragments of the HMW protein; variants of wild-type HMW 
protein or fragment thereof, containing one or more amino 
acid deletions, insertions or substitutions; an4 chimeric 
proteins comprising a heterologous polypeptide fused to the 
C-terminal or N-terminal or internal segment of a whole or a 
30 portion of the HMW protein. 

As used herein and in the claims, the term "HMW 
protein" refers to a native purified or recombinant purified 
high molecular weight protein of a species of Chlamydia 
wherein the apparent molecular weight (as determined by SDS- 
35 PAGE) is about 105-115 kDa. AS used herein and in the 
claims, the term " rHMW protein" refers to recombinant HMW 
protein. 
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Another object of the present invention is to 
provide an isolated substantially pure nucleic acid molecule 
encoding a HMW protein or a fragment or an analogue thereof. 
Preferred is the nucleic acid sequence wherein the encoded 

5 HMW protein comprises the amino acid sequence of any of SEQ 
ID Nos.: 2, 15 and 16, or a fragment thereof, particularly 
SEQ ID NOS.: 3, 17, 25-37. Also included is an isolated 
nucleic acid molecule comprising a DNA sequence of any of SEQ 
ID Nos.: 1, 23-24 or a complementary sequence thereof; a 

10 fragment of the HMW DNA sequence having the nucleic acid 
sequence of any of SEQ ID Nos.: 4-14, 18-22 or the 
complimentary sequence thereto; and a nucleic acid sequence 
which hybridizes under stringent conditions to any one of the 
sequences described above. The nucleic acid that hybridizes 

15 under stringent condition preferably has a sequence identity 
of about 70 % with any of the sequences identified above, 

more preferably about 90 %. 

The production and use of derivatives and analogues 
of the HMW protein are within the scope of the present 
20 invention. In a specific embodiment, the derivative or 

analogue is functionally active, i.e., capable of exhibiting 
one or more functional activities associated with a full- 
length, wild-type HMW protein. As one example, such 
derivatives or analogues which have the desired 
. 25 immunogenicity or antigenicity can be used, for example, in 
immunoassays, for immunization, etc. A specific embodiment 
relates to a HMW fragment that can be bound by an anti-HMW 
antibody. Derivatives or analogues of HMW can be tested for 
the desired activity by procedures known in the art. 
30 In particular, HMW derivatives can be made by 

altering HMW sequences by substitutions, additions or 
deletions that provide for functionally equivalent molecules. 
Due to the degeneracy of nucleotide coding sequences, other 
DNA sequences which encode substantially the same amino acid 
35 sequence as a HMW gene may be used in the practice of the 
present invention. These include but are not limited to 
nucleotide sequences comprising all or portions of genes 



_ g _ PENY3-594263.1 



which are altered by the substitution of different codons 
that encode a functionally equivalent amino acid residue 
within the sequence, thus producing a silent change. 
Likewise, the HMW derivatives of the invention include, but 
5 are not limited to, those containing, as a primary amino acid 
sequence, all or part of the amino acid sequence of a HMW 
protein including altered sequences in which functionally 
equivalent amino acid residues are substituted for residues 
within the sequence resulting in a silent change. For 

10 example, one or more amino acid residues within the sequence 
can be substituted by another amino acid of a similar 
polarity which acts as a functional equivalent, resulting in 
a silent alteration. Substitutes for an amino acid within 
the sequence may be selected from other members of the class 

15 to which the amino acid belongs. For example, the nonpolar 
(hydrophobic) amino acids include alanine, leucine, 
isoleucine, valine, proline, phenylalanine, tryptophan and 
methionine. The polar neutral amino acids include glycine, 
serine, threonine, cysteine, tyrosine, asparagine, and 

20 glutamine. The positively charged (basic) amino acids 
include arginine, lysine and higtidine. The negatively 
charged (acidic) amino acids include aspartic acid and 

glutamic acid. 

In a specific embodiment of the invention, proteins 

25 consisting of or comprising a fragment of a HMW protein 

consisting of at least 6 (continuous) amino acids of the HMW 
protein is provided. In other embodiments, the fragment 
consists of at least 7 to 50 amino acids of the HMW protein, 
in specific embodiments, such fragments are not larger than 

30 35, 100 or 200 amino acids. Derivatives or analogues of HMW 
include but are not limited to those molecules comprising 
regions that are substantially homologous to HMW or fragments 
thereof {e.g., in various embodiments, at least 60% or 70% or 
80% or 90% or 95% identity over an amino acid sequence of 

35 identical size or when compared to an aligned sequence in 
which the alignment is done by a computer homology program 
known in the art) or whose encoding nucleic acid is capable 
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of hybridizing to a coding HMW sequence, under stringent, 
moderately stringent, or nonstringent conditions. 

The HMW derivatives and analogues of the invention 
can be produced by various methods known in the art. The 
5 manipulations which result in their production can occur at 
the gene or protein level. For example, the cloned HMW gene 
sequence can be modified by any of numerous strategies known 
in the art (Sambrook et al . , 1989, Molecular Cloning, A 
Laboratory Manual, 2d. Ed., Cold Spring Harbor Laboratory 
10 press, cold spring Harbor, New York). The sequence can be 
cleaved at appropriate sites with restriction 
endonuclease(s), followed by further enzymatic modification 
if desired, isolated, and ligated in vitro. In the 
production of the gene encoding a derivative or analogue of 
15 HMW, care should be taken to ensure that the modified gene 
remains within the same translational reading frame as HMW, 
uninterrupted by translational stop signals, in the gene 
region where the desired HMW activity is encoded. 

Additionally, the HMW-encoding nucleic acid 
20 sequence can be mutated in vitro or in vivo, to create and/or 
destroy translation, initiation, and/or termination 
sequences, or to create variations in coding regions and/or 
form new restriction endonuclease sites or destroy 
preexisting ones, to facilitate further in vitro 
25 modification. Any technique for mutagenesis known in the art 
can be used, including but not limited to, chemical 
mutagenesis, in vitro site-directed mutagenesis (Hutchinson, 
C, et al., 1978, J. Biol. Chem 253:6551), use of TAB® 

linkers (Pharmacia) , etc. 

30 Manipulations of the HMW sequence may also be made 

at the protein level. Included within the scope of the 
invention are HMW protein fragments or other derivatives or 
analogues which are differentially modified during or after 
translation, e.g., by glycosylation, lipidation, acetylation, 

35 phosphorylation, amidation, derivatization by known 

protecting/blocking groups, proteolytic cleavage, linkage to 
an antibody molecule or other cellular ligand, etc. Any of 
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numerous chemical modifications may be carried out by known 
techniques, including but not limited to specific chemical 
cleavage by cyanogen bromide, trypsin, chymotrypsin, papain, 
V8 protease, NaBH4; acetylation, formylation, oxidation, 
5 reduction; metabolic synthesis in the presence of 

tunicamycin; etc. 

In addition, analogues and derivatives of HMW can 
be chemically synthesized. For example, a peptide 
corresponding to a portion of a HMW protein which comprises 
10 the desired domain, or which mediates the desired activity in 
vitro, can be synthesized by use of a peptide synthesizer. 
Furthermore, if desired, nonclassical amino acids or chemical 
amino acid analogues can be introduced as a substitution or 
addition into the HMW sequence. Non-classical amino acids 
15 include but are not limited to the D-isomers of the common 
amino acids, a-amino isobutyric acid, 4-aminobutyric acid, 
Abu, 2-amino butyric acid, 7-Abu, e-Ahx, 6-amino hexanoic 
acid, Aib, 2-amino isobutyric acid, 3-amino propionic acid, 
ornithine, norleucine, norvaline, hydroxy pro line, sarcosine, 
20 citrulline, cysteic acid, t-butylglycine, t-butylalanine, 
phenylglycine, cyclohexylalanine, jS-alanine, fluoro-amino 
acids, designer amino acids such as ^-methyl amino acids, Ca- 
methyl amino acids, Na -methyl amino acids, and amino acid 
analogues in general. Furthermore, the amino acid can be D 
25 (dextrorotary) or L (levorotary) . 

Another object of the invention is to provide a 
recombinant expression vector adapted for transformation of a 
host or for delivery of a HMW protein to a host comprising 
the nucleic acid molecule of SEQ ID No.: 1, 23 or 24 or any 
30 fragment thereof. Preferably, the recombinant expression 
vector is adapted for transformation of a host and comprises 
an expression means operatively coupled to the nucleic acid 
molecule for expression by the host of said HMW protein or 
the fragment or analogue thereof. More preferred is the 
35 expression vector wherein the expression means includes a 

nucleic acid portion encoding a leader sequence for secretion 
from the host or an affinity domain coupled to either the N- 
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or C-terminus of the protein or the fragment or analogue 
thereof . 

A further aspect of the invention includes a 
transformed host cell containing an expression vector 
5 described above and the recombinant HMW protein or fragment 
or analogue thereof producible by the transformed host cell. 

Still a further aspect of the invention is directed 
to a HMW protein recognizable by an antibody preparation that 
specifically binds to a peptide having the amino acid 
10 sequence of SEQ ID No. 2, 15-16 or a fragment or 
conservatively substituted analogue thereof. 

Antigenic and/ or immunogenic compositions are 
another aspect of the invention wherein the compositions 
comprise at least one component selected from the following 

15 group: . . 

a) a HMW protein, wherein the molecular weight is 
about 105-115 kDa, as determined by SDS-PAGE, 
or a fragment or analogue thereof ; 

b) an isolated nucleic acid molecule encoding a 
20 HMW protein, or a fragment or analogue 

. thereof; 

c) an isolated nucleic acid molecule having the 
sequence of SEQ ID Nos. 1, 22, 23 or 24 , the 
complimentary sequence thereto or a nucleic 

25 acid sequence which hybridizes under stringent 

conditions thereto or fragment thereof; 

d) an isolated recombinant HMW protein, or 
fragment or analogue thereof, producible in a 
transformed host comprising an expression 

30 vector comprising a nucleic acid molecule as 

defined in b) or c) and expression means 
operatively coupled to the nucleic acid 
molecule for expression by the host of said 
HMW protein or the fragment or analogue 

35 thereof; 



- 10 - 



PENY3-594263.1 



( 



e) a recombinant vector comprising a nucleic acid 
encoding a HMW protein or fragment or analogue 
thereof ; 

f) a transformed cell comprising the vector of e) 
5 and optionally an adjuvant, and a 

pharmaceutical ly acceptable carrier or diluent 
therefor, said composition producing an immune 
response when administered to a host. 
Preferred adjuvants include cholera holotoxin or subunits, E, 
10 coli heat labile holotoxin, subunits and mutant forms 

thereof, alum, QS21, and MPL. Particularly, preferred are 
alum, LTR192G, mLT and QS21. 

Also included are methods for producing an immune 
response in a mammal or a bird comprising administering to 
15 said mammal, an effective amount of the antigenic or the 
immunogenic composition described above. 

Another aspect of the invention is directed to 
antisera raised against the antigenic or immunogenic 
composition of the invention, and antibodies present in the 
20 antisera that specifically bind a HMW protein or a fragment 
or analogue thereof. Preferably the antibodies bind a HMW 
protein having the amino acid sequence of SEQ ID Nos.: 2, IS- 
IS or fragment or a conservatively substituted analogue 
thereof. Also included are monoclonal antibodies that 
25 specifically bind a HMW protein or a fragment or analogue 
thereof . 

A further aspect of the invention includes 
pharmaceutical and vaccine compositions comprising an 
effective amount of at least one component selected from the 

30 following group: 

a) a HMW protein, wherein the isolated protein 

molecular weight is about 105-115 kDa, as 

determined by SDS-PAGE, or a fragment or 

analogue thereof; 
35 b) an isolated nucleic acid molecule encoding a 

HMW protein, or a fragment or analogue 

thereof ; 
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c) an isolated nucleic acid molecule having the 
sequence of SEQ ID Nos.: 1, 22, 23 or 24 the 
complimentary sequence thereto or a nucleic 
acid sequence which hybridizes under stringent 
conditions thereto or a fragment thereof; 

d) an isolated recombinant HMW protein, or 
fragment or analogue thereof producible in a 
transformed host comprising an expression 
vector comprising a nucleic acid molecule as 
defined in b) or c) and expression means 
operatively coupled to the nucleic acid 
molecule for expression by the host of said 
HMW protein of a Chlamydia species or the 
fragment or analogue thereof; 

e) a recombinant vector, comprising a nucleic 
acid encoding a HMW protein or fragment or 
analogue thereof; 

f) a transformed cell comprising the vector of 

e) , 

g) antibodies that specifically bind the 
component of a) , b) , c) , d) or e) , and 

a pharmaceutically acceptable carrier or diluent therefor. 
Preferred are vaccine compositions which are effective at the 

mucosal level. 

The invention also includes a diagnostic reagent 
which may include any one or more of the above mentioned 
aspects, such as the native HMW protein, the recombinant HMW 
protein, the nucleic acid molecule, the immunogenic 
composition, the antigenic composition, the antisera, the 
antibodies, the vector comprising the nucleic acid, and the 
transformed cell comprising the vector. 

Methods and diagnostic kits for detecting Chlamydia 
or anti-Chlamydia antibodies in a test sample are also 
included, wherein the methods comprise the steps of: 

a) contacting said sample with an antigenic 

composition comprising Chlamydia HMW protein 
or a fragment or analogue thereof or 
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immunogenic composition or antibodies thereto 
to form Chlamydia antigen: anti-Chlamydia 
antibody immunocomplexes, and further, 
b) detecting the presence of or measuring the 
5 amount of said immunocomplexes formed during 

step a) as an indication of the presence of 
said Chlamydia or anti-Chlamydia antibodies in 
the test sample. 
The diagnostic kits for detecting Chlamydia or antibodies 
10 thereto comprise antibodies, or an antigenic or immunogenic 
composition comprising Chlamydia HMW protein or a fragment or 
analogue thereof, a container means for contacting said 
antibodies or composition with a test sample suspected of 
having ant i-Chlamydia antibodies or Chlamydia and reagent 
15 means for detecting or measuring Chlamydia antigen: anti- 
Chlamydia antibody immunocomplexes formed between said 
antigenic or immunogenic composition or said antibodies and 

said test sample. 

A further aspect of the present invention provides 
2 0 methods for determining the presence of nucleic acids 

encoding a HMW protein or a fragment or analogue thereof in a 
test sample, comprising the steps of: 

a) contacting the test sample with the nucleic 
acid molecule provided herein to produce 

2 5 duplexes comprising the nucleic acid molecule 

and any said nucleic acid molecule encoding 
the HMW protein in the test sample and 
specifically hybridizable therewith; and 

b) determining the production of duplexes. 

30 The present invention also provides a diagnostic 

kit and reagents therefor, for determining the presence of 
nucleic acid encoding a HMW protein or fragment or analogue 
thereof in a sample, comprising: 

a) the nucleic acid molecule as provided herein; 
35 b) means for contacting the nucleic acid with the 

test sample to produce duplexes comprising the 
nucleic acid molecule and any said nucleic 
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acid molecule encoding the HMW protein in the 
test sample and specifically hybridizable 
therewith; and 
c) means for determining the production of 
5 duplexes. 

Also included in this invention are methods of 
preventing, treating or ameliorating disorders related to 
Chlamydia in an animal including mammals and birds in need of 
such treatment comprising administering an effective amount 
10 of the pharmaceutical or vaccine composition of the 

invention. Preferred disorders include. a Chlamydia bacterial 
infection, trachoma, conjunctivitis, urethritis, 
lymphogranuloma venereum (LGV) , cervicitis, epididymitis, or 
endometritis, pelvic inflammatory disease (PID) , salpingitis, 
15 tubal occlusion, infertility, cervical cancer, and 

artherosCleirosis. Preferred vaccine or pharmaceutical 
compositions include those formulated for in vivo 
administration to a host to confer protection against disease 
or treatment therefor caused by a species of Chlamydia. Also 
20 preferred are compositions formulated as a microparticle, 
capsule, liposome preparation or emulsion. 



25 



30 



35 



4. ABBREVIATIONS 
ant i -HMW 

ATCC 
immuno-reactive 

kDa 
OG 
OMP 
OMPs 
PBS 
PAGE 



HMW polypeptide antibody or 
antiserum 

American Type Culture Collection 

capable of provoking a cellular or 
humoral immune response 

kilodaltons 

n-octyl /3-D-glucopyranoside or octyl 
glucoside 

outer membrane protein 
outer membrane proteins 
phosphate buffered saline 
polyacrylamide gel electrophoresis 
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Dolvpeptide = a peptide of any length, preferably 
poiypepT:iae having ten or more amino acid 

residues 

SDS = sodium dodecylsulfate 

SDS-PAGE = sodium dodecylsulfate polyacrylamide 

gel electrophoresis 

Nucleotide or nucleic acid sequences defined herein 
are represented by one-letter symbols for the bases as 
follows: 
A (adenine) 
C (cytosine) 
G (guanine) 
T (thymine) 
U (uracil) 
M (A or C) 
R (A or G) 
W (A or T/U) 
S (C or G) 

Y (C or T/U) 
20 K (G or T/U) 

V (A or C or G; not T/U) 
H (A or C or T/U; not G) 
D (A or G or T/U; not C) 
B (C or G or T/U; not A) 

25 N (A or C or G or T/U) or (unknown) 

Peptide and polypeptide sequences defined herein 
are represented by one- letter symbols for amino acid residues 
as follows: 
A (alanine) 
R (arginine) 
N (asparagine) 
D (aspartic acid) 
C (cysteine) 
Q (glutamine) 
E (glutamic acid) 
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G (glycine) 
H (histidine) 
I (isoleucine) 
L (leucine) 
5 K (lysine) 
M (methionine) 
F (phenylalanine) 
p (proline) 
S (serine) 
10 T (threonine) 
W (tryptophan). 

Y (tyrosine) 

V (valine) 
X (unknown) 

15 



The present invention may be more fully understood 
by reference to the following detailed description of the 
invention, non-limiting examples of specific embodiments of 
the invention and the appended figures. 

20 

5. BRIEF DESCRIPTION OF THE FIGURES 

Figure 1: Western blot analysis of C. trachomatis 

elementary bodies (EBs) . 

25 Gradient purified EBs were solubilized in 

standard Laemmli SDS-PAGE sample buffer 
containing 2-mercaptoethanol, boiled for "3 
minutes and loaded onto a 4-12% Tris-glycine 
gradient gel containing SDS and 

30 electrophoresed at lOOV. Immediately 

following electrophoresis, proteins were 
electroblotted onto PVDF membranes at 4<>C for 
"^2.5 hours at "SOV. The blocked membrane was 
probed using a 1/5,000 dilution of anti-rHMWP* 

35 antibody (K196) for 1.5 hours at room 

temperature. Following washing, the membrane 
was treated with a 1/5,000 dilution of a goat 
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Figure 2. 
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Figure 3. 



Figure 4. 
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25 



30 



Figure 5. 
Figur 6 . 



35 



anti-rabbit IgG antibody conjugated to HRP for 
1 hour at room temperature. The blot was 
developed using a standard TMB substrate 
system. 

Three immunoreactive bands detected in EBs and 
RES. Dot indicates HMW Protein of about 105- 
115 kDa. 

consensus Nucleic Acid Sequence encoding the 
open reading frame of the HMW protein from C. 
trachomatis LGV Lj. 

Deduced Amino Acid Sequence of the HMW protein 
from the PGR open reading frame from C. 
trachomatis LGV Lj. 

SDS-PAGE of partially purified recombinant HMW 
protein from C. trachomatis LGVL^ expressed in 
E. coli. Counter stained and prestained SDS- 
PAGE standards were used as molecular weight 
markers. The positions of the molecular 
weight markers in the gel are noted on the 
left and right side of the figure by lines to 
the molecular weights (kDa) of some of the 
markers. See Text Example 10 for details. 
Lane A ; Mark 12 Wide Range Molecular Weight 
Markers (Novex) ; myosin, 200 Kdal; B- 
galactosidase, 116.3 Kdal; phosphorylase B, 
97.4 Kdal; bovine serum albumin, 66.3 Kdal. 
Lane B ; C. trachomatis L2 recombinant HMWP. 
Lane C : SeeBlue Prestained Molecular Weight 
markers (Novex); myosin, 250 Kdal; bovine 
serum albumin, 98 Kdal; glutamic 
dehydrogenase, 64 Kdal. 

Map of plasmids pAH306, pAH310, pAH312, pAH316 

and the PGR open reading frame. 

Predicted amino acid sequences, of HMW Protein 

for C. trachomatis L2, B, and F. 

The C. trachomatis L2 sequence is given in the 

top line and begins with the first residue of 
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the mature protein, E. Potential eucaryotic 
N-glycosylation sequences are underlined. A 
hydrophobic helical region flanked by proline- 
rich segments and of suitable length to span 
the lipid bilayer is underlined and enclosed 
in brackets. Amino acid differences 
identified in the B and F serovars are 
designated below the HMWP protein sequence. 
Figure 7 indirect florescence antibody staining of C. 

trachomatis Nil (serovar F) inclusion bodies 
using anti-rHMWP' antibody. 

Panel A: Post-immunization sera from rabbit 
K196. Chlamydia inclusion bodies are stained 
yellow. 

Panel B: Pre-immunization sera from rabbit 
K196. 

6. DETAILED DESCRIPTION OF TH K INVENTION 

The term "antigens" and its related term 
"antigenic" as used herein and in the claims refers to a 
substance that binds specifically to. an antibody or T-cell 
receptor. Preferably said antigens are immunogenic. 

The term "immunogenic" as used herein and in the 
claims refers to the ability to induce an immune response, 
e.g., an antibody and/or a cellular immune response in a an 
animal, preferably a mammal or a bird. 

The term "host" as used herein and in the claims 
refers to either in vivo in an animal or in vitro in 
mammalian cell cultures. 

An effective amount of the antigenic, immunogenic, 
pharmaceutical, including, but not limited to vaccine, 
composition of the invention should be administered, in which 
"effective amount" is defined as an amount that is sufficient 
to produce a desired prophylactic, therapeutic or 
ameliorative response in a subject, including but not limited 
to an immune response. The amount needed will vary depending 
upon the immunogenicity of the HMW protein, fragment, nucleic 
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acid or derivative used, and the species and weight of the 
subject to be administered, but may be ascertained using 
standard techniques. The composition elicits an immune 
response in a subject which produces antibodies, including 
5 anti-HMW protein antibodies and antibodies that are 

opsonizing or bactericidal. In preferred, non-limiting, 
embodiments of the invention, an effective amount of a 
composition of the invention produces an elevation of 
antibody titer to at least three times the antibody titer 
10 prior to administration. In a preferred, specific, non- 
limiting embodiment of the invention, approximately 0.01 to 
2000 ^lg and preferably 0.1 to 500 /xg are administered to a 
host. Preferred are compositions additionally comprising an 
adjuvant. 

15 Immunogenic, antigenic, pharmaceutical and vaccine 

compositions may be prepared as injectables, as liquid 
solutions or emulsions. The HMW protein may be mixed with 
one or more pharmaceutically* acceptable excipient which is 
compatible with the HMW protein. Such excipients may 

20 include, water, saline, dextrose, glycerol, ethanol, and 
combinations thereof. 

Immunogenic, antigenic, pharmaceutical and vaccine 
compositions may further contain one or more auxiliary 
substance, such as wetting or emulsifying agents, pH 

25 buffering agents, or adjuvants to enhance the effectiveness 
thereof. Immunogenic, antigenic, pharmaceutical and vaccine 
compositions may be administered pareriterally , by injection, 
subcutaneous ly or intramuscularly. 

Alternatively, the immunogenic, antigenic, 

30 pharmaceutical and vaccine compositions formed according to 
the present invention, may be formulated and delivered in a 
manner to evoke an immune response at mucosal surfaces. 
Thus, the immunogenic, antigenic, pharmaceutical and vaccine 
' compositions may be administered to mucosal surfaces by, for 
35 example, the nasal, oral (intragastric), ocular, branchiolar, 
intravaginal or intrarectal routes. Alternatively, other 
modes of administration including suppositories and oral 
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formulations may be desirable. For suppositories, binders 
and carriers may include, for example, polyalkalene glycols 
or triglycerides. Oral formulations may include normally 
employed incipients such as, for example, pharmaceutical 
5 grades of saccharine, cellulose and magnesium carbonate. 
These compositions can take the form of solutions, 
suspensions, tablets, pills, capsules, sustained release 
formulations or powders and contain about 0.001 to 95% of the 
HMW protein. The immunogenic, antigenic, pharmaceutical and 
10 vaccine compositions are administered in a manner compatible 
with the dosage formulation, and in such amount as will be 
therapeutically effective, protective or immunogenic. 

Further, the immunogenic, antigenic, pharmaceutical 
and vaccine compositions may be used in combination with or 
15 conjugated to one or more targeting molecules for delivery to 
specific cells of the immune system, such as the mucosal 
surface. Some examples include but are not limited to 
vitamin B12, bacterial toxins or fragments thereof, y 
monoclonal antibodies and other specific targeting lipids, i. 
20 proteins, nucleic acids or carbohydrates. 

The quantity to be administered depends on the 
subject to be treated, including, for example, the capacity 
of the individual's immune system to synthesize antibodies, 
and if needed, to produce a cell-mediated immune response. 
25 Precise amounts of active ingredient required to be 

administered depend on the judgment of the practitioner. 
However, suitable dosage ranges are readily determinable by 
one skilled in the art and may be of the order of 0.1 to 1000 
micrograms of the HMW protein, fragment or analogue thereof. 
30 Suitable regimes for initial administration and booster doses 
are also variable, but may include an initial administration 
followed by subsequent administrations. The dose may also 
depend on the route (s) of administration and will vary \ 
according to the size of the host. * 
35 The concentration of the HMW protein in an :<i 

antigenic, immunogenic or pharmaceutical composition 
according to the invention is in general about 0.001 to 95%. 
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A vaccine which contains antigenic raaterial o£ only one 
pathogen is a monovalent vaccine. Vaccines which contain 
fntigenlc material o. several pathogens are comhinea vacc.nes 
and !lsc belong to the present invention. Such combined 
5 vaccines contain, for example, material from various 

pathogens or from various strains of the same pathogen, or 
from combinations of various pathogens. 

The antigenic, immunogenic or pharmaceutical 
preparations, including vaccines, may comprise as the 
LLstimulating material a nucleotide vector 
least a portion of the gene encoding the HHW protein or the 
at least a portion of the gene may be used directly for 

immunization . 

TO efficiently induce humoral immune responses 

15 (HIR) and cell-mediated immunity (CMI) , immunogens are 

typically emulsified in adjuvants, ^^^'^''^^^^'^^iltll. 
significantly improved if the immunogen is co-administered 
with an adjuvant. Adjuvants may act by retaining t^e 
immunogen locally near the site of administration to produce 
20 I depot effect facilitating a slow, sustained release of 

antigen to cells 6f the immune system. Adjuvants can also ^ 
!«ract cells of the immune system to an immunogen depot and 
stimulate such cells to elicit immune responses. 

Many adjuvants are toxic, inducing granulomas, 

• ■ .i.^^^Hnns (Freund's complete adjuvant, 
25 acute and chronic inflammations (rreuno 

FCAl cvtolysis (saponins and Pluronic polymers) and 

ULity, arthritis and anterior uveitis (.PS and ^P, . 
Although FCA is an excellent adjuvant and widely used in 
Research, it is not licensed for use in human or veterinary 
30 vaccines because of its toxicity. 

Desirable characteristics of ideal adjuvants 

include: 

(1) lack of toxicity; 

(2) ability to stimulate a long-lasting immune 
35 response; ^^^^^^^^^^ „3„„,acture and stability in 

long-term storage; 
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(4) ability to elicit either CMI or HIR or both to 
antigens administered by various routes, if required; 

(5) synergy with other adjuvants; 

(6) capability of selectively interacting with 
5 populations of antigen presenting cells (APC) ; 

(7) ability to specifically elicit appropriate ThI 
or Th2 cell-specific ixnmune responses; and 

(8) ability to selectively increase appropriate 
antibody isotype levels (for example, IgA) against antigens. 

10 immunostimulatory agents or adjuvants have been 

used for many years to improve the host immune responses to, 
for example, vaccines. Intrinsic adjuvants, such as 
lipopoly saccharides, normally are the components of the 
killed or attenuated bacteria used as vaccines. Extrinsic 

15 adjuvants are immunomodulators which are typically non- 

covalently linked to antigens and are formulated to enhance 
the host immune responses. Thus, adjuvants have been 
identified that enhance the immune response to antigens 
delivered parenterally . Aluminum hydroxide and aluminum 

20 phosphate (collectively commonly referred to as alum) are 

routinely used as adjuvants in human and veterinary vaccines. 
The efficacy of alum in increasing antibody responses to 
diphtheria and tetanus toxoids is well established and a 
HBsAg vaccine has been adjuvanted with alum. 
. 25 Other extrinsic adjuvants may include saponins 

complexed to membrane protein antigens (immune stimulating 
complexes) , pluronic polymers with mineral oil, killed 
mycobacteria in mineral oil, Freund's complete adjuvant, 
bacterial products, such as muramyl dipeptide (MDP) and 

30 lipopolysaccharide (LPS) , as well as lipid A, and liposomes. 

International Patent Application, PCT/US95/ 09005 
incorporated herein by reference describes mutated forms of 
heat labile toxin of enterotoxigenic E. coli ("mLT") . U.S. 
Patent 5,057,540, incorporated herein by reference, describes 
35 the adjuvant, Qs21, an HPLC purified non-toxic fraction of a 
saponin from the bark of the South American tree Quiliaja 
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saponaria molina 3D-MPL is described in great Britain Patent 
2,220,211, and is incorporated herein by reference. 

U.S. Patent No. 4,855,283 granted to Lockhoff et al 
on August 8, 1989 which is incorporated herein by reference, 
5 teaches glycolipid analogues including N-glycosylamides, N- 
glycosylureas and N-glycosylcarbamates, each of which is 
substituted in the sugar residue by an amino acid, as immuno- 
modulators or adjuvants. Lockhoff reported that N- 
glycosphospholipids and glycoglycerolipids, are capable of 
10 eliciting strong immune responses in both herpes simplex 
virus vaccine and pseudorabies virus vaccine. Some 
glycolipids have been synthesized from long chain-alky lamines 
and fatty acids that are linked directly with the sugars 
through the anomeric carbon atom, to mimic the functions of 
15 the naturally occurring lipid residues. 

U.S. Patent No. 4,258,029 granted to Moloney, 
incorporated herein by reference thereto, teaches that 
octadecyl tyrosine hydrochloride (OTH) functioned as an . 
adjuvant when complexed with tetanus toxoid and formalin 
20 inactivated type I, II and III poliomyelitis virus vaccine. 
Lipidation of synthetic peptides, has also been used to 
increase their immunogenicity . 

Therefore, according to the invention, the 
immunogenic, antigenic, pharmaceutical, including vaccine, 
25 compositions comprising a HMW protein, or a fragment or 

derivative thereof or a HMW encoding nucleic acid or fragment 
thereof or vector expressing the same, may further comprise 
an adjuvant, such as, but not limited to alum, mLT, QS21 and 
all those listed above. Preferably, the adjuvant is selected 
30 from alum, LT, 3D-mPL, 6r Bacille Calmette-Guerine (BCG) and 
mutated or modified forms of the above, particularly mLT and 
LTR192G. The compositions of the present invention may also 
further comprise a suitable pharmaceutical carrier, including 
but not limited to saline, bicarbonate, dextrose or other 
35 aqueous solution. Other suitable pharmaceutical carriers are 
described in Remington's Ph^riraceut i cal Sciences, Mack 
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Publishing Company, a standard reference text in this field, 
which is incorporated herein by reference in its entirety. 

Immunogenic, antigenic and pharmaceutical, 
including vaccine, compositions may be administered in a 
5 suitable, nontoxic pharmaceutical carrier, may be comprised 
in microcapsules, and/or may be comprised in a sustained 

release implant. 

Immunogenic, antigenic and pharmaceutical, 
including vaccine, compositions may desirably be administered 
10 at several intervals in order to sustain antibody levels, 

and/ or may be used in conjunction with other bacteriocidal or 
bacteriostatic methods. 

As used herein and in the claims, "antibodies" of 
the invention may be obtained by any conventional methods 
15 known to those skilled in the art, such as but not limited to 
the methods described in Antibodies A Laboratory Manual (E. ' 
Harlow, D. Lane, Cold Spring Harbor Laboratory Press, 1989) 
which is incorporated herein by reference in its entirety. 
The term "antibodies" is intended to include all forms, such 
20 as but not limited to polyclonal, monoclonal, purified IgG, 
IgM, IgA and fragments thereof, including but not limited to 
fragments such as Fv, single chain Fv (scFv) , F(ab')2,. Fab 
fragments (Harlow and Leon, 1988, Antibody, Cold Spring 
Harbor); single chain antibodies (U.S. Patent No. 4,946,778) 
25 chimeric or humanized antibodies (Morrison et al., 1984, 

Proc. Nafl Acad. Sci. USA 81:6851); Neuberger et al., 1984, 
Nature 81:6851) and complementary determining regions (CDR) , 
(see Verhoeyen and Windust, in Molecular Immunology 2ed., by 
B.D. Hames and D.M. Glover, IRL Press, Oxoford University 
30 Press, 1996, at pp. 283-325), etc. 

In general, an animal (a wide range of vertebrate 
species can be used, the most common being mice, rats, guinea 
pig, bovine, pig, hamsters, sheep, birds and rabbits) is 
immunized with the HMW protein or nucleic acid sequence or 
35 immunogenic fragment or derivative thereof of the present 
invention in the absence or presence of an adjuvant or any 
agent that enhances the immunogen's effectiveness and boosted 
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at regular intervals. The animal serum is assayed for the 
presence of desired antibody by any convenient method. The 
serum or blood of said animal can be used as the source of 
polyclonal antibodies. 
5 For monoclonal antibodies, animals are treated as 

described above. When an acceptable antibody titre is 
detected, the, animal is euthanized and the spleen is asepti- 
cally removed for fusion. The spleen cells are mixed with a 
specifically selected immortal myeloma cell line, and the 
10 mixture is then exposed to an agent, typically polyethylene 
glycol or the like, which promotes the fusion of cells, 
under these circumstances fusion takes place in a random 
selection and a fused cell mixture together with unfused 
cells of each type is the resulting product. The myeloma 
15 cell lines that are used for fusion are specifically chosen 
such that, by the use of selection media, such as HAT: 
hypoxanthine, aminopterin, and thymidine, the only cells to 
persist in culture from the fusion mixture are. those that are 
hybrids between cells derived from the immunized donor and 
20 the myeloma cells. After fusion, the cells are diluted and 
cultured in the selective media. The culture media is 
screened for the presence of antibody having desired 
specificity towards the chosen antigen. Those cultures 
containing the antibody of choice are cloned by limiting 
25 dilution until it can be adduced that the cell culture is 
single cell in origin. 

Antigens rimmuno qftns and TmrnunoassaYS 

The HMW protein or nucleic acid encoding same, and 

30 fragments thereof are useful as an antigen or immunogen for 
the generation of anti-HMW protein antibodies or as an 
antigen in immunoassays including enzyme-linked immunosorbent 
assays (ELISA) , radioimmmunoassays (RIA) and other non-enzyme 
linked antibody binding assays or procedures known in the art 

35 for the detection of anti-bacterial, anti-Ch2an,ydia , and 
anti-HMW protein antibodies. In ELISA assays, the HMW 
protein is immobilized onto a selected surface, for example, 
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a surface capable of binding proteins such as the wells of a 
polystyrene microtiter plate. After washing to remove 
incompletely absorbed HMW protein, a nonspecific protein 
solution that is known to be antigenically neutral with 
5 regard to the test sample may be bound to the selected 

surface. This allows for blocking of nonspecific absorption 
sites on the immobilizing surface and thus reduces the 
background caused by nonspecific bindings of antisera onto 
the surface. 

10 The immobilizing surface is then contacted with a 

sample, such as clinical or biological materials, to be 
tested in a manner conducive to immune complex 
(antigen/antibody) formation. This may include diluting the 
sample with diluents, such as solutions of bovine gamma 
15 globulin (BGG) and/or phosphate buffered saline (PBS)/Tween. 
The sample is then allowed to incubate for from 2 to 4 hours, 
at temperatures such as of the order of about 20° to 37 °C. 
Following incubation, the sample-contacted surface is washed 
to remove non-immunocomplexed material. The washing 
20 procedure may include washing with a solution, such as 
PBS/Tween or a borate buffer. Following formation of 
specific immunocomplexes between the test sample and the 
bound HMW protein, and subsequent washing, the occurrence, 
and even amount, of immunocomplex formation may be determined 
. 25 by subjecting the immunocomplex to a second antibody having 
specificity for the first antibody. If the test sample is of 
human origin, the second antibody is an antibody having 
specificity for human immunoglobulins and in general IgG. 

To provide detecting means, the second antibody may 
30 have an associated activity such as an enzymatic activity 
that will generate, for example, a color development upon 
incubating with an appropriate chromogenic substrate. 
Detection may then be achieved by detecting color generation. 
Quantification may then be achieved by measuring the degree 
35 of color generation using, for example, a visible 

spectrophotometer and comparing to an appropriate standard. 
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Any other detecting means known to those skilled in the art 

are included. 

Another embodiment includes diagnostic kits 
comprising all of the essential reagents required to perform 
5 a desired immunoassay according to the present invention. 
The diagnostic kit may be presented in a commercially 
packaged form as a combination of one or more containers 
holding the necessary reagents. Such a kit may comprise HMW 
protein or nucleic acid encoding same or fragment thereof, a 

10 monoclonal or polyclonal antibody of the present invention in 
combination with several conventional kit components. 
Conventional kit components will be readily apparent to those 
skilled in the art and are disclosed in numerous 
publications, including Antibodie s A Laboratory Manual (E. 

15 Harlow, D. Lane, Cold Spring Harbor Laboratory Press, 1989) 
which is incorporated herein by reference in its entirety. 
Conventional kit components may include such items as, for 
example, microtitre plates,- buffers to maintain the pH of the 
assay mixture (such as, but not limited to Tris, HEPES, 

20 etc.), conjugated second antibodies, such as peroxidase 

conjugated anti-mouse IgG (or any anti-IgG to the animal from 
which the first antibody was derived) and the like, and other 
standard reagents . 

25 Nucleic Acids and Uses Thereof 

The nucleotide sequences of the present invention, 
including DNA and RNA and comprising a sequence encoding the 
HMW protein or a fragment or analogue thereof, may be 
synthesized using methods known in the art, such as using 

30 conventional chemical approaches or polymerase chain reaction 
(PCR) amplification using convenient pairs of oligonucleotide 
primers and ligase chain reaction using a battery of 
contiguous oligonucleotides. The sequences also allow for 
the identification and cloning of the HMW protein gene from 

35 any species of Chlamydia, for instance for screening 
chlamydial genomic libraries or expression libraries. 
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The nucleotide sequences encoding the HMW protein 
of the present invention are useful for their ability to 
selectively form duplex molecules with complementary 
stretches of other protein genes. Depending on the 
5 application, a variety of hybridization conditions may be 
employed to achieve varying sequence identities. In specific 
aspects, nucleic acids are provided which comprise a sequence 
complementary to at least 10, 15, 25, 50, 100, 200 or 250 
nucleotides of the HMW protein gene (Figure 2). In specific 
10 embodiments, nucleic acids which hybridize to an HMW protein 
nucleic acid (e.g. having sequence SEQ ID NO: 1, 23 or 24) 
under annealing conditions of low, moderate or high 
stringency conditions. 

For a high degree of selectivity, relatively 
15 stringent conditions are used to form the duplexes, such as, 
by way of example and not limitation, low salt and/or high 
temperature conditions, such as provided by 0.02 M to 0.15 M 
NaCl at temperatures of between about 50°C to 70°C. For some 
applications, less stringent hybridization conditions are 
20 required, by way of example and not limitation such a 0.15 M 
to 0.9 M salt, at temperatures ranging from between about 
20«C to 55»C. Hybridization conditions can also be rendered 
more stringent by the addition of increasing amounts of 
formamide, to destabilize the hybrid duplex. Thus, 
25 particular hybridization conditions can be readily 
manipulated, and will generally be a method of choice 
depending on the desired results. By way of example and not 
limitation, in general, convenient hybridization temperatures 
in the presence of 50% formamide are: ^2°C for a probe which 
30 is 95 to 100% homologous to the target fragment, 37 °C for 90 
to 95% homology and 32 °C for 70 to 90% homology. 

Low, moderate and high stringency conditions are 
well known to those of skill in the art, and will vary 
predictably depending on the base composition and length of 
35 the particular nucleic acid sequence and on the specific 
organism from which the nucleic acid sequence is derived. 
For guidance regarding such conditions see, for example, 
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SaiabrooH et al., 1989, Molecular cloning, A Laboratory 
Lnual, second Edition, cold Sprin, Harbor Press N.V , PP. 
" 47-9 57, and Ausubel et al., 1989, Current Protocols .n 
Llecular Biology, Green Publishing Associates and Wxley 
S interscience, N.V. which is incorporate here.n, by reference, 
in the preparation of genomic libraries, DHA 
fragments are generated, some of which will encode parts or 
tle^hole of Chlamydia HMW protein. The DNA may be cleaved 
at specific sites using various restriction enzymes. 
„ : ternatively, one may use .Hase in the presence 

to fragment the DNA, or the DNA can be physically sheared 
for example, by sonication. The DNA fragments can t^en be 
separated according to size by standard techniques, including 
but not limited to, agarose and polyacrylamide gel 

„ electrophoresis, column chromatography and sucrpse 9"^^-^ 
centrifugation. The DNA fragments can then be inserted into 
suitable vectors, including but not limited to 
cosmids, bacteriophages lambda or T„ bacmids and yeast 
artificial chromosome (YAC) . (See, for example, SambrooK 

20 al 1989, f.-^.-.,.. noning, A Laboratory Manual, 2a Ed., 
:l ; SprilgTarbor Laboratory Press, Cold Spring Harbor New 
VorK; Glover, D.M. (ed.,, 1985, m^^^^^^m^^f^^ 
jEBEoach, MKL press, Ltd., Oxford, U.K. Vol. I, '"^^ 
^^^--Ubrary may be screened by nucleic acid >>y--^---- 
„ L labeled probe (Benton and Davis, 1977, SSi^ 

Grunstein and Hogness, 1975, Pro. Natl. Ac-.-^ • "•S'' ^" 

The genomic libraries may be screened with labeled 
degenerate oligonucleotide probes corresponding to the -mo 

30 acid sequence of any peptide of HMW protein using °Pt--l 
ap^roacLs well Known in the art. In P-"-l« embodiments, 
the screening probe is a degenerate oligonucleotide that 
corresponds to the sequence of SEQ ID NO: 4. In another 
embodiment, the screening probe may be a degenerate 

35 oligonucleotide that corresponds to the sequence of SEQ 
N0:5. in an additional embodiment, any one of the 
oligonucleotides of SEQ ID NOs: 6-9, 12-14 and 18-21 are 
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as the probe. In further embodiments, any one of the 
sequences of SEQ ID NOs: 1, 10-11, 22-24 or any fragments 
thereof, or any complement of the sequence or fragments may 
be used as the probe. Any probe used preferably is 15 
5 nucleotides or longer. 

Clones in libraries with insert DNA encoding the 
HHW protein or fragments thereof will hybridize to one or 
more of the degenerate oligonucleotide probes. Hybridization 
of such oligonucleotide probes to genomic libraries are 

10 carried out using methods known in the art. For example, 
hybridization with the two above-mentioned oligonucleotide 
probes may be carried out in 2X SSC, 1.0% SDS at 50^ C and 
washed using the same conditions. 

In yet another aspect, clones of nucleotide 

15 sequences encoding a part or the entire HMW protein or HMW- 
derived polypeptides may also be obtained by screening 
Chlamydia expression libraries. For example. Chlamydia DNA 
or Chlamydia cDNA generated from RNA is isolated and random 
fragments are prepared and ligated into an expression vector 

20 (e.g., a bacteriophage, plasmid, phagemid or cosmid) such 
that the inserted sequence in the vector is capable of being 
expressed by the host cell into which the vector is then 
introduced. Various screening assays can then be used to 
select for the expressed HMW protein or HMW-derived 

25 polypeptides. In one embodiment, the various anti-HMW 
antibodies of the invention can be used to identify the 
desired clones using methods known in the art. See, for 
example, Harlow and Lane, 1988, Antibodies: A Laboratory 
Manual . Cold Spring Harbor Laboratory Press, Cold Spring 

30 Harbor, NY, Appendix IV. Clones or plaques from the library 
are brought into contact with the antibodies to identify 
those clones that bind. 

In an embodiment, colonies or plaques containing 
DNA that encodes HMW protein or HMW-derived polypeptide could 

35 be detected using DYNA Beads according to Olsvick et al., 
29th ICAAC, Houston, Tex. 1989, incorporated herein by 
reference. Anti-HMW antibodies are crosslinked to tosylated 

- 3 0 - PENY4-631696.1 



4' 



DYNA Beads M280, and these antibody-containing beads would 
then be used to adsorb to colonies or plaques expressing HMW 
protein or HMW-derived polypeptide. Colonies or plaques 
expressing HMW protein or HMW-derived polypeptide is 
V 5 identified as any of those that bind the beads • 

Alternatively, the anti-HMW antibodies can be 
nonspecifically immobilized to a suitable support, such as 
silica or Celite*^ resin. This material would then be used to 
adsorb to bacterial colonies expressing HMW protein or HMW- 
10 derived polypeptide as described in the preceding paragraph. 

In another aspect, PGR amplification may be used to 
produce substantially pure DNA encoding a part of or the 
whole of HMW protein from Chlamydia genomic DNA. 
Oligonucleotide primers, degenerate or otherwise, 
15 corresponding to known HMW protein sequences can be used as 
primers. In particular embodiments, an oligonucleotide, 
degenerate or otherwise, encoding the peptide having an amino 
acid sequence of SEQ ID NO: 2 , 3 or 15-17 or any portion 
thereof may be used as the 5' primer. For fragment examples, 
20 a 5» primer may be made from any one of the nucleotide 
sequences of SEQ ID NO: 4-7, 10, 12, 22-24 or any portion 
thereof. Nucleotide sequences, degenerate or otherwise, that 
are reverse complements of SEQ ID NO: 11, 13 or 14 may be 
used as the 3' primer. 
25 PGR can be carried out, e.g., by use of a Perkin- 

Elmer Cetus thermal cycler and Taq polymerase (Gene Amp"*) . 
One can choose to synthesize several different degenerate 
primers, for use in the PGR reactions. It is also possible 
to vary the stringency of hybridization conditions used in 
30 priming the PGR reactions, to allow for greater or lesser 
degrees of nucleotide sequence similarity between the 
degenerate primers and the corresponding sequences in 
Chlamydia DNA. After successful amplification of a segment 
of the sequence encoding HMW protein, that segment may be 
35 molecularly cloned and sequenced, and utilized as a probe to 
isolate a complete genomic clone. This, in turn, will permit 
the determination of the gene's complete nucleotide sequence, 
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the analysis of its expression, ana tne Pro--ion its 

protein product for functional ^^^f ^^^^ „,,,elc 

in a clinical diagnostic embodiment, the nu 
acid sequences of the HMW protein genes of the present 
S intention may be used in combination with an appropriate 

Indicator means, such as a label ^^^^^^^^..^^r means 

V, V. ^ Hi nation A wide variety of appropriate inaicaT. 

in ins art, including radioactive, enzymatic or 

:rr ii,ands, such ^ -^'^'^::t:r:^ 

" rn"."'i:rore ragrsreirnts.^n en.me ta, su.h 
as urease, alKaline phosphatase or peroxidase, instead of 
:adioacti;e tag may be used. In the case of ^^^^^^ 
colorimetric indicator substrates are Known which can be 

. ::icyed to :;:rfirry:rrdLrtion 

spectrophotometrically, to laentu-y v 

„ith samples containing HMW P"-^"^/:"/^— J ,enes 
Of the present invention are userux i , 

" ~ L-:^:r r:= 

serum, amniotic fluid, middle ear effusion ^^"^^'/Z^^; 

, „ urine, tears, mucus, ---—-/^ r/:/ " : acted 
i-fssues is absorbed or otherwise affixed to a 
matrix ;r surface. The fixed, single-stranded nucleic acid 
then subjected to specific hybridization with selected 

IS then suDjeui-tiu 

probes comprising the nucleic acid -^^""^ 
30 protein encoding genes or fragments °^ ^ '^^l^,^^^,^ 
■ the present ^-ntion under a s -^c^o^ -^^^^^^^^^^ 
conditions will depend on the particular 
on L particular criteria required depending on, for 
Ixample the G.C contents, type of target nucleic acid, 
3S sorrce'f nucleic acid, size of hybridization probe etc 
prilowing washing of the hybridization sur ace so as t 
remove non-specif ically bound probe molecules, specific 
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Hvbridization is detected, or even quantified, by means of 
Te abel It is preferred to select nucleic acid sequence 
:rrtions ;.icn are conserved a.onq species --^^^^^^^ 

15 bp and may be in the range 
selected probe may be at least 15 np o 

5 of about 30 to 90 bp. 

,^r,. f.<»n of tn- "t^ r.-^f-ein Gene ^„„^„1 
Plasmid vectors containing replicon and -""""^ 
sequences «hicH are derived from species comP^tib- with the 
host cell may be used for the expression of ^^^^^^^^ 
encoding the HMW protein or fragments thereof xn expression 
systems. Expression vectors contain all the necessary 
elements for the transcription and translation °* ^^e 
inserted protein coding sequence. The vector ordinarily 
IS ca^rLs a replication site, as well as marKing sequences 
which are capable cf providing phenotype selection in 
which are cap example, E. coli may be transformed 

transformed cells. For example, 

using PBR322 which contains genes for ampiciUin and 

using pt> other commercially available 

tetracycline resistance cells. Other comm y 

,0 vectors are useful, including but not limited to pZERO 
pTrc99A, PUC19, PUCIS, pKK233-3, pKXI, pCA.. -J^'^' 
pTrxFus, prastBac, pThioHis,, pTrcHis, pTrcHis2, P^^^" 
'he pla m!ds or phage, must also contain, or *° 
contain, promoters which can b. used by the host cell for 

9^ exoression of its own proteins. 

" ' in addition, phage vectors containing repliccn and 

control sequences that are compatible with: the host can be ^ 
Ted a a transforming vector in connection with f^-^^-^- 
Tor e-mple, the phage in lambda GE,^-X1 may be utilized in 
30 maHing recombinant phage vectors which can be used to 
transform host cells, such as E. coll LE392. 

promoters commonly used in recombinant DNA 
construction include the ^-lactamase (penicillinase) and 
Tactose p— systems and ot^er microbial promoters h 

3S as the .7 promoter system as ^^^-^^^^J^J^l' J;^^^^^^ 
4 952 496. Details concerning the nucleo-ciae y 
p;oro;ers are Known, enabling a sKiUed worKer to ligate them 
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functionally with genes. The particular promoter used will 
generally be matter of choice depending upon the desired 
results. 

in accordance with this invention, it is preferred 
5 to make the HMW protein by recombinant methods, particularly 
when the naturally occurring HMW protein as isolated from a 
culture of a species of Chla;nydia may include trace amounts 
of toxic materials or other contaminants. This problem can 
be avoided by using recombinant ly produced HMW protein in 
10 heterologous systems which can be isolated from the host in a 
manner to minimize contaminants in the isolated material. 
Particularly desirable hosts for expression in this regard 
include Gram positive bacteria which do not have LPS and are, 
therefore endotoxin free. Such hosts include species of 
15 Bacillus and may be particularly useful for the production of 
non-pyrogenic rHMW protein, fragments or analogues thereof. 

A variety of host-vector systems may be utilized to 
express the protein-coding sequence. These include but are 
not limited to mammalian cell systems infected with virus 
20 (e.g., vaccinia virus, adenovirus, etc.); insect cell systems 
infected with virus (e.g., baculovirus) ; microorganisms such 
as yeast containing yeast vectors, or bacteria transformed 
with bacteriophage DNA, plasmid DNA, or cosmid DNA. Hosts 
that are appropriate for expression of the HMW protein genes, 
. 25 fragments, analogues or variants thereof, may include E. 
coli, Bacillus species, Haemophilus, fungi, yeast, such as 
Saccharomyces pichia, Bordetella, or the baculovirus 
expression system may be used. Preferably, the host cell is 
a bacterium, and most preferably the bacterium is E. coli, B. 
30 suJbtills or Salmonella. 

The expression elements of vectors vary in their 
strengths and specificities. Depending on the host-vector 
system utilized, any one of a number of suitable 
transcription and translation elements may be used. In a 
35 preferred embodiment, a chimeric protein comprising HMW 

protein or HMW-derived polypeptide sequence and a pre and/or 
pro sequence of the host cell is expressed. In other 

PENY3-594263.1 



- 34 - 



preferred embodiments, a chimeric protein comprising HMW 
protein or HMW-derived polypeptide sequence fused with, for 
example, an affinity purification peptide, is expressed. In 
further preferred embodiments, a chimeric protein comprising 
5 HMW protein or HMW-derived polypeptide sequence and a useful 
immunogenic peptide or protein is expressed. In preferred 
embodiments, HMW-derived protein expressed contains a 
sequence forming either an outer-surface epitope or the 
receptor-binding domain of native HMW protein. 
10 Any method known in the art for inserting DNA 

fragments into a vector may be used to construct expression 
vectors containing a chimeric gene consisting of appropriate 
transcriptional/ translational control signals and the 
protein coding sequences. These methods may include in vitro 
15 recombinant DNA and synthetic techniques and in vivo 
recombinants (genetic recombination) . Expression of a 
nucleic acid sequence encoding HMW protein or HMW-derived 
polypeptide may be regulated by a second nucleic acid 
sequence so that the inserted sequence is expressed in a host 
20 transformed with the recombinant DNA molecule. For example, 
expression of the inserted . sequence may be controlled by any 
promoter /enhancer element known in the art. Promoters which 
may be used to control expression of inserted sequences 
include, but are not limited to the SV.40 early promoter 
25 region (Bernoist and Chambon, 1981, Nature 290:304-310), the 
promoter contained in the 3- long terminal repeat of Rous 
sarcoma virus (Yamamoto et al., 1980, Cell 22:787-797), the 
herpes thymidine kinase promoter (Wagner et al., 1981, Proc. 
Acad. Sci. U.S.A. 78:1441-1445), the regulatory 
30 sequences of the metallothionein gene (Brinster et al., 1982, 
Nature 296:39-42) for expression in animal cells; the 
promoters of /3-lactamase (Villa-Kamarof f et al., 1978, Proc. 
M.tl. Acad, -qni. U.S.A. 75:3727-3731), tac (DeBoer et al., 
1983, Prnc. Natl. An.>H . Rni . U.S.A. 80:21-25), P^, or trc for- 
35 expression in bacterial cells (see also "Useful proteins from 
recombinant bacteria" in Scientific American, 1980, 242:74- 
94); the nopaline synthetase promoter region or the 
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cauliflower mosaic virus 35S RNA promoter (Gardner et al., 
1981, TJnnl. Acids Res. 9:2871), and the promoter of the 
photosynthetic enzyme ribulose biphosphate carboxylase 
(Herrera-Estrella et al., 1984, Nature 310; 115-120) for 
5 expression implant cells; promoter elements from yeast or 
other fungi such as the Gal4 promoter, the ADC (alcohol 
dehydrogenase) promoter, PGK (phosphoglycerol kinase) 
promoter, alkaline phosphatase promoter. 

Expression vectors containing HMW protein or HMW- 
10 derived polypeptide coding sequences can be identified by 

three general approaches: (a) nucleic acid hybridization, (b) 
presence or absence of "marker" gene functions, and (c) 
expression of inserted sequences such as reactivity with 
anti-HMW antibody. In the first approach, the presence of a 
15 foreign gene inserted in an expression vector can be detected 
by nucleic acid hybridization using probes comprising 
sequences that are homologous to the inserted HMW protein or 
HMW-derived polypeptide coding sequence. In the second 
approach, the recombinant vector /host system can be 
20 identified and selected based upon the presence or absence of 
certain "marker" gene functions (e.g., thymidine kinase 
activity, resistance to antibiotics, transformation 
phenotype, occlusion body formation in baculovirus, etc.) 
caused by the insertion of foreign genes in the vector. For 
. 25 example, if the HMW protein or HMW-derived polypeptide coding 
sequence is inserted within the marker gene sequence of the 
vector, recombinants containing the insert can be identified 
by the absence of the marker gene function. In the third 
approach, recombinant expression vectors can be identified by 
3 0 assaying the foreign gene product expressed by the 

recombinant. Such assays can be based, for example,, on the 
physical or functional properties of HMW protein or HMW- 
derived polypeptide in in vitro assay systems, e.g., binding 
to an HMW ligand or receptor, or binding with anti-HMW 
35 antibodies of the invention, or the ability of the host cell 
to hemagglutinate or the ability of the cell extract to 
interfere with hemagglutination by Chlamydia. 
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once a particular recombinant DNA molecule is 
identified and isolated, several methods known in the art may 
be used to propagate it. Once a suitable host system and 
growth conditions are established, recombinant expression 
5 vectors can be propagated and prepared in quantity. As 
explained above, the expression vectors which can be used 
include, but are not limited to, the following vectors or 
their derivatives: human or animal viruses such as vaccinia 
virus or adenovirus; insect viruses such as baculovirus; 
10 yeast vectors; bacteriophage vectors (e.g., lambda), and 
plasmid and cosmid DNA vectors, to name but a few. 

in addition, a host cell strain may be chosen which 
modulates the expression of the inserted sequences, or 
modifies and processes the gene product in the specific 
15 fashion desired. Expression from certain promoters can be 
elevated in the presence of certain inducers; thus, 
expression of the genetically engineered HMW protein or HMW- 
derived HMW may be controlled. Furthermore, different host 
cells have characteristic and specific mechanisms for the 
20 translational and post-translational processing and 

modification of proteins. Appropriate cell lines or host 
systems can be chosen to ensure the desired modification and 
processing of the foreign protein expressed. 

The proteins, polypeptides, peptides, antibodies 
25 and nucleic acids of the invention are useful as reagents for 
clinical or medical diagnosis of Chlamydia infections and for 
scientific research on the properties of pathogenicity, 
virulence, and infectivity of Chlamydia, as well as host 
defense mechanisms. For example, DNA and RNA of the 
30 invention can be used as probes to identify the presence of 
Chlamydia in biological specimens by hybridization or PGR 
amplification. The DNA and RNA can also be used to identify 
other bacteria that might encode a polypeptide related to the 
Chlamydia HMW protein. The proteins of the invention may be 
35 used to prepare polyclonal and monoclonal antibodies that can 
be used to further purify compositions containing the 
proteins of the invention by affinity chromatography. The 
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p„tei„s can also .e usea in stanaara ™ 

antibodies to Chlamydia in a sample, 
for the presence of antiDoaiet. 



7. 

5 



T^ds that contain portions of the^^ 

f *.v,e aene encoding the HMW 

having the open reading frame of ^he gen 

protein of Chlamydia that are desorxbea -"^ 

Lrein have .een ^^-^ ;;::3r.ri nVrfve, KocKviXle, 
collection (RTCC) located at 12301 Par 

.0 Maryland 208=2, U.S.A.. pursuant "^^^^^f^^ J, 
pursuant to 37 CFR l.BOS and pr.or ^ I 'Zi.^ 

rr:::; prseri^re:-^^^^^^^^^ 

of tne gentito yj. ^+.^>.^on<= will become 

samples of the deposited -terxals w.l 

.5 available to the public upon grant of a 

this united Stated patent application ^ 

20 invention. Any equiv f^aaments or analogues 

similar or equivalent proteins - /^^^^^^^ ^.^^.^ ^^,3 3,,pe 
thereof as described in this application are with 

of the invention. 

Kir. vi ^^-f-. Dep osited 

• 25 Plasmid ATCC_A September 8, 1997 

PAH342 ATCC 985538 P 

" above disclosure generally describes the 

30 present invention. . .ore ^^^^^^ ^ Z:^. 
.elow in the following J..e not intended 

solely for the purpose of -l^-^^^^ ^^^^ ^,^^1 

to limit the scope of the J'^^f circumstances 

substitution Of equivalents ^^l^TTsX^^^^ terms have 
35 suggest or render expedient. Although specif 

been employed herein, such terms are --^^^ ^ ^^^^ 
descriptive sense and not for purposes of limitation. 
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Methods of molecular genetics, protein biochemistry 
and immunology used but not explicitly described in the 
disclosure and examples are amply reported in the scientific 
literature and are well within the ability of those skilled 
5 in the art. 

Exam ple 1. 

Tcoi^j-ion a rH Pnrifica t H^n nf Mature Chlamydia Protein 

McCoy cells were cultured either in standard 225cm2 
10 tissue culture flasks or in Bellco spinner flasks (Cytodex 
microcarrier, Pharmacia) at 37 °C in 5% CO, using DMEM media 
supplemented with 10% chlamydia-antibody free fetal bovine 
serum, glucose and nonessential amino acids. C. trachomatis 
L, elementary bodies (ATCC VR-902B) were prepared from lysates 
15 of infected McCoy cells. Basically, McCoy cells infected 
with C. trachomatis (LGV) were sonicated and cellular 

debris was removed by centrif ugation. The supernatant 
containing chlamydial elementary bodies (EBs) was then 
centrifuged and the pellet containing EBs was resuspended in 
20 Hanks' balanced salts solution (HBSS) . RNase/DNase solution 
was added and incubated at 37 "C for 1 hour with occasional 
mixing. The EB containing solution was layered onto a 
discontinuous density gradient (40%, 44% and 54%) of 
Angiovist 370 (mixture of diatrizoate melgumine and 
25 diatrizoate sodium, Berlex Laboratories, Wayne, NJ) and 

ultracentrifuged for separation of the EBs on the gradient. 
After centrifugation the EBs were harvested from the gradient 
between the interface of the 44% and 54% Angiovist 370 
layers. The EBs were washed in phosphate buffered saline and 

30 iresuspended in HBSS. 

Purified EBs were sequentially extracted with 0.1% 
OGP [high ionic strength] in HBSS to remove peripheral 
surface proteins and held on ice. The same EB preparation 
was then extracted with 1.0% OGP, 10 mM DTT, 1 mM PMSF, 10 mM 

35 EDTA, in a 50 mM Tris pH 7.4 buffer. Extracts were dialyzed 
(3500 MWCO) to remove detergent and other reagents and 
concentrated by lyophilization. Protein containing extracts 
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were diluted in HBSS and passed over commercially available 
heparin-sepharose columns (HiTrap Col., Pharmacia). After 
samples were applied to the heparin column nonadhered 
proteins were removed by washing with excess HBSS. Bound 
5 proteins were batch eluted with PBS containing 2M sodium 
chloride. Eluents were dialyzed extensively to remove salt 
and then lyophilized. The heparin-binding proteins were size 
fractionated by SDS-PAGE and visualized by silver staining or 
analyzed by Western blotting. Protein (s) of about 105-115 
10 KDa present in moderate amounts were detected as shown in 
Figure 1. The isoelectric point of the native HMW protein 
was determined to be about 5.95. 

To obtain one N-terminal amino acid sequence, 
sufficient quantities of the HMW protein (> 5 ug) were 
15 electroblotted onto a PVDF membrane (Applied Biosystems) , and 
stained with Coomassie blue. Immobilized HMW protein was 
released from the membrane and treated in situ with low 
levels of endopeptidase Lys-C, endopeptidase Arg-C and/or 
endopeptidase Glu-C to fragment the native protein. The 
20 resulting peptide fragments were purified by HPLC and their 
N-terminal amino acid sequences determined using an ABI 430 
Protein Sequenator and standard protein sequencing 
methodologies. The N-terminal amino acid sequence is: 



25 



E-I-M-V-P-Q-G-I-Y-D-G-E-T-L-T-V-S-F-X-Y 



and is denoted SEQ ID No.: 3. 

When a composite PDB+SwissProt+PIR+GenPept database 
(>145 K unique sequences) was searched with the HMW protein 

30 N-terminal sequence (20 residues) using rigorous match 

parameters, no precise homologies were found. Thus the HMW 
protein is a novel chlamydial protein. Since this protein 
was isolated under conditions that should release only 
peripheral membrane proteins (e.g. Omp2) , these data indicate ^ 

35 that the HMW protein is a surface-associated protein. ^ 



Example 2 



- 40 - 



PENY3-594263.1 



p^^p.^-nfinn nf Antib^^Hi^^ ho Whole Chi amydia EBS 

TO aid in the characterization of the HMW protein, 
hyperimmune rabbit antisera was raised against whole EBs from 
C. trachomatis L,. Each animal was given a total of three 
5 immunizations of about 250 ug of Chlamydia EBs per injection 
(beginning with complete Freund's adjuvant and followed with 
incomplete Freund's adjuvant) at approximately 21 day 
intervals. At each immunization, approximately half of the 
material was administered intramuscularly (i.m.) and half was 
10 injected intranodally. Fourteen days after the third 

vaccination a fourth booster of about 100 ug of EBs was given 
i.m. and the animals exsanguinated 7-10 days later. A titre 
of 1:100,000 was obtained as determined by ELISA. 

15 Bxam ple 3. 

noi-^.-r.i nati on nf Post-t ransi ati o nal modifications 

Recently, several C. trachomatis membrane- 
associated proteins have been shown to be post- 
translationally modified. The 18 kDa and 32 kDa cysteine- 

20 rich EB proteins, which are lectin-binding proteins, have 
been shown to carry specific carbohydrate moieties (Swanson, 
A F. and C.C. Kuo. 1990. Jnff^ct.' Immun,. 58:502-507). 
incorporation of radioiabelled palmitic acid has been used to 
demonstrate that the about 27 kDa C. tracho;natis Mip-Uke 

25 protein is lipidated (Lundemose, A.G., D.A. Rouch, C.W. Penn, 
and J.H. Pearce. 1993. i^Bacteriol. 175:3669-3671). Swanson 
et al. have discovered that the MOMP from the serovar 
contains N-acetylglucosamine and/ or N-acetylgalactosamine and 
these carbohydrate moieties mediate binding of MOMP to Hela 

30 cell membranes. 

To ascertain whether the HMW protein. is 
glycosylated, EBs are grown on McCoy cells in the presence of 
tritiated galactose or glucosamine, subjected to heparin 
affinity chromatography and the heparin binding proteins 

35 analyzed by SDS-PAGE and autoradiography. Briefly, McCoy 
cells are grown in T225 flasks under standard conditions 
(DMEM + 10% FCS, 35ml per flask, 10% CO,) to about 90% 
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confluency and inoculated with sufficient EBs to achieve 90%- 
100% infectivity. Following a 3 hour infection period at 
37 oc cycloheximide is added (1 ug/ml) to inhibit host cell 
protein synthesis and the cultures reincubated for an 

5 additional 4-6 hours. Approximately 0.5 mCi of tritiated 
galactose (D- [4 , 5-'H (N) ]galactose, NEN) or glucosamine (D- 
[1,6-'H(N) glucosamine, NEN) is then be added to each flask and 
the cultures allowed to incubate for an additional 30-40 
hours. Cells are harvested by scraping and EBs purified by 
10 gradient centrifugation. HMW protein is isolated from 1.0% 
OGP surface extracts by affinity chromatography, eluted with 
NaCl and analyzed by SDS-PAGE using '^C-labelled molecular 
weight markers (BRL) then subjected to autoradiography. 
Dried gels are exposed for 1-4 weeks to Kodak X-AR film at - 

15 70°C. 

To determine post synthesis lipid modification, 
C. trachomatis serovar is cultivated on monolayers of McCoy 
cells according to standard procedures. Approximately 24 
hours postinfection, conventional culture media (DMEM + 10% 
20 FCS) is removed and replaced with a serum-free medium 

containing cycloheximide (lug/ml) and [U-"C] palmitic acid 
(0.5 mCi/T225 flask, NEN) and incubated for a further 16-24 
hours to allow protein lipidation to occur. Surface EB 
extracts are prepared, hepar in-binding proteins are Isolated 
25 and analyzed by autoradiography as described above. 

The functionality of glycosylated or lipidated 
" moieties is assessed by treating whole EBs or OGP surface 
extracts with appropriate glycosidases. Following 
carbohydrate removal, extracts are subjected to affinity 
30 chromatography and SDS-PAGE to determine whether the HMW 
protein retains the ability to bind to heparin sulfate. 

Example 4. 

rionin q nf the N-t>^r-Tnina1 Sea P ^nl- of the H^fW Protein Gene 
35 Degenerate oligonucleotides were designed based on 

the N-terminal amino acid sequence of the HMW protein and 
were synthesized. These oligonucleotides were then used to 
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generate gene-specific PGR products that were employed as 
hybridization probes to screen a C. tracho;natis L, XZAPII DNA 
library to isolate the gene for the HMW protein. 

Briefly, appropriate low degeneracy peptide 
5 segments were identified from the N-terminal and internal 
amino acid sequence data by computer analysis (MacVector, 
IBI) and used to guide the design of low degeneracy sequence- 
specific oligonucleotide PGR primer sets. 

Using the N-terminal primary sequence as a guide, 
10 four degenerate oligonucleotide probes complementary to the 
first six residues of the HMW peptide E-I-M-V-P-Q (residues 
.1-6 of SEQ ID NO.: 3), and comprising all possible nucleotide 
combinations (total degeneracy = 192 individual sequences) , 
have been designed and employed as forward amplification 
15 primers. 

SEQ ID No . 4 5 ' -GAA-ATH-ATG-GTN-GCN-GAA-3 ' . 
SEQ ID No. 5 5'-GAA-ATH-ATG-GTN-CCN-CAG-3' 
SEQ ID No. 6 ■ 5'-GAG-ATH-ATG-GTN-GCN-GAA-3' 
20 SEQ ID NO. 7 5 ' -GAG-ATH-ATG-GTN-CGN-CAG-3 ' 

TWO additional oligonucleotide probes representing the 
reverse complement DNA sequence of the internal five residue 
peptide Y-D-G-E-T (residues 9-13 of SEQ ID No.: 3) , and 
, 25 comprising all possible nucleotide combinations (total 

degeneracy = 128 individual sequences) , have been designed 
. and employed as reverse amplification primers. 

SEQ ID NO. 8 5 ' -NGT-YTC-NCC-RTC-ATA-3 ' 
30 SEQ ID NO. 9 5 ' -NGT-YTC-NCC-RTC-GTA-3 ' 

Oligonucleotides were synthesized on an ABI Model 
380B DNA synthesizer using a 0.2 ^imol scale column (trityl- 
on, auto-cleavage) and standard phosphoramidite chemistry. 
35 crude oligonucleotides were manually purified over C-18 
syringe columns (OP Columns, ABI). Purity and yield were 
ascertained spectrophotometrically (230/260/280 ratios). 
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standard PGR amplification reactions (2 mM Mg^^, 200 
umol dNTPs, 0.75 units AmpliTaq, 50 ul final volume) were 
programmed using about 0.2 ug C. trachomatis DNA (about 
3X10'' copies of the HMW protein gene if single copy) and 
5 about 100 pmol of each forward (N-terminal oligo) and reverse 
(internal oligo) primer. Higher than normal concentrations 
of primers ("20 pmol/50 ul) were used for amplification, at 
least initially, in order to compensate for primer 
degeneracy. Amplification of target sequences was achieved 
10 using a standard 30-cycle, three-step thermal profile, i.e. 
95°C, 30 sec; 60»C, 45 sec, 72«C, 1 min. Amplification was 
carried out in sealed 50ul glass capillary tubes using a 
Idaho Technologies thermal cycler. To verify that the PGR 
products generated during these amplification reactions were 
15 specific for the HMW protein coding sequence and were not 
"primer-dimer" or other DNA amplification artifacts, 
amplimers were purified using silica-gel spin columns 
(QIAGEN) , cloned into the PGR cloning vector pZERO 
(StrataGene) , and subjected to direct DNA sequence analysis. 
2 0 The DNA sequence for the cloned PGR products were 

determined using conventional dideoxy-terminator sequencing 
chemistry and a modified T7 DNA polymerase (Sequenase, USB) . 
Briefly, each double stranded plasmid template was denatured 
by a brief treatment with NaOH. Following neutralization, 
25 each denatured template was used to program 4 separate 
sequencing reactions. Each reaction contained the M13 
universal forward sequencing primer (21mer) but a different 
ddNTP/dNTP termination mix (i.e. A,G,C, or T) . Termination 
products were labelled by including [a-^^SJdATP in the 
30 reaction ( -50uGi/reaction, >3000Ci/mmol , Amersham) . 

Individual extension products were denatured (formamide, 
-95«C) and subjected to high resolution denaturing 
polyacrylamide gel electrophoresis (6% acrylamide, 8M urea, 
TAE buffer, -500V, "SOmin) . Sequencing gels were then 
35 transferred to filter paper (Whatmann 3MM) , dried under 

vacuum, and then autoradiographed at -70°G for 24-72 hours. 
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Base ladders were read manually from each gel and a consensus 

sequence determined. 

HMW protein-specific amplimers suitable for library 
screening and/or Southern blotting were produced by PCR and 
5 uniformly radiolabelled during the amplification process by 
adding [a-"P]dNTPs (about 50 uCi each dNTP, Amersham, >5000 
Ci/mmol) to the reaction mixture. Labelling reactions were 
performed as above except reactions were performed in 0.5ml 
microcentrifuge tubes using a Bellco Thermal Cycler. 
10 unincorporated label and amplification primers were removed 
from the reaction mixture using centrifugal size-exclusion 
chromatography columns (BioSpin 6 columns, BioRad) . 

A highly redundant c .trachomatis serovar 1^ DNA 
library (>50,000 primary clones) has been constructed by 
15 cloning size-fractionated fragments >10 Kbp produced from a 
partial EcoRI digest of genomic DNA into the lambda cloning 
vector XZAPII (Stratagene) . Radiolabelled HMW protein- 
specific PCR products were used to screen this library for 
recombinant clones that carry all or part of the HMW protein 
20 coding sequence. Standard recombinant DNA procedures and 
methodologies were employed for these . experiments. All 
phage that hybridized with these probes were purified to 
homogeneity by sequential rounds of plating and hybridization 
screening. Once reactive phage were purified, insert- 
, 25 containing phagmids (pBluescript SK- derivatives) were 

excision-rescued from the parental phage by coinfecting host 
cells with an appropriate helper phage, e.g. R408 or VCSM13 
(Stratagene) . Individual phagmids were further purified by 
streak-plating on LB agar containing ampicillin (lOOug/ml) 
30 and selecting for individual colonies. 

TO confirm purified phagemid derivatives carried 
the HMW protein sequences, plasmid DNA was prepared and used 
to program amplification reactions containing the HMW 
protein-specific PCR primer sets. The presence of HMW 
35 protein-specific inserts was confirmed by the production of 
the appropriate sized PCR product. 
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Plasmid pAH306 is one HMW protein-containing 
derivative that was isolated by these methodologies. 



Physical Mapping of PAH306 
5 The inserts from pAH3 06 were physically mapped and 

the location of HMW protein gene determined using appropriate 
six-base restriction endonucleases (e.g. EcoRI, Hindlll, 
BamHI, PstI, Smal, Kpnl, etc.) and HMW protein coding 
sequences localized by Southern hybridization employing 
10 radiolabelled N-terminal-specif ic PGR products as probes. 
The orientation and extent of HMW protein-specific sequences 
were determined by PGR analysis using primer sets consisting 
of HMW protein-specific forward primers and reverse primers 
complementary to either the T3 or T7 promoter sequences 
15 located in the cloning vector. 

Plasmid pAH306 was determined to contain a single 
-6.6 Kbp EcoRI fragment of chlamydial origin. Directional 
PGR analysis of pAH306 demonstrated this derivative encodes 
roughly 1.5Kbp of the N-terminal region of the HMW protein 
20 gene. 

The DNA sequence for the HMW protein gene encoded 
on PAH306 was obtained for both strands via conventional 
"sequence-walking" coupled with asymmetric PGR cycle 
sequencing methodologies (ABI Prism Dye-Terminator Cycle 
25 sequencing, Perkin-Elmer ) . Sequencing reactions were 
programmed with undigested plasmid DNA ("O.Sug/rxn) as a 
template and appropriate HMW protein-specific sequencing 
primers ('3 . 5pmol/rxn) . 

in addition to the template and sequencing primer, 
30 each sequencing reaction (-20ul) contained the four different 
dNTPs (i.e. A,G,G, and T) and the four corresponding ddNTPs 
(i.e. ddA, ddG, ddG, and ddT) terminator nucleotides; with 
each terminator being conjugated to one of four different 
fluorescent dyes. Single strand sequencing elongation 
35 products were terminated at random positions along the 
template by the incorporation of the dye-labelled ddNTP 
terminators. Fluorescent dye-labelled termination products 
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vacuu., suspended xn a Tempi ^^.^^ 
Elmer), denatured at 95 c for 5mr , automated 
S resolution capillary electrophoreses on an 

OHA saquenator ^^^'^'^'"'f'^J'.^^^, £,om individual 

ONA sequence f ^^/"f ^f^^^,,, f^^o.escent peaK 
reactions were collected ^"^^'>^/"^^""p„„,,^C computer 
intensities analyzed automatically on a f""^'^"^ 

,0 :sin, ABX sequence Analysis "'-^/J^t: ^r^^^^ 
individually autoanaly.ed D.A sequences «ere 
tor accuracy before being merged into a conse 
■string" using AutoAssembler -""^^^^ ^^^^ f ' M.30e 
strands of the HHW protein gene -^-"^^^f/^^ I "^„,p„3,,e 

XS were sequenced and these data compiled to crea 

sequence for the HMW protein ^'"^^^''^^^'T^J^I; I„ Ho.: 

encoding the segment of H« P-^^ ^^^f/^, in Figure 2. 
10 and is represented by nucleotides 382 to 

a man of PAH306 is shown in Figure 5. 

A map oi y , „ ,o rr orimary amino acid 

Database analysis ^-f- ^ion sites, 
homologies, hydropathy profiles, 

junctional/conformational -^''^^"J^^^^^ protein was 
predicted amino acid sequences "^/^^ 
performed using GeneRunner and Intelligenti 
, 25 indicating the HMW protein is novel. 



Chromosome walking was employed 
30 terminal portion of the HMW P"^"" ^^/^ J, ./.^e 

KcoRl fragment distal to the --;; ";;;37;i,., sequence 
.ature HMW protein a- P--^ initial 
" - vector was - ^ ^^^^^^^^ 
Chromosome walK. uriei.iy, ^ ^^„^4-o size 

3s „ith BamHl and .coKX ^ ^rororr^^^^^^^^ - 
rbirr^ ^eTeTrer-otrKbp /amHV.coRl ... .and was 
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excised from the gel and purified using commercially 
available silica gel microcentrifuge chromatography columns 

and reagents (QIAGEN) . 

The purified 0.6Kbp B/E fragment was radiolabelled 
5 with [a-dATP] (>3000Ci/mmol, Amersham) via random-priming 
labelling methodologies employing commercially available 
reagents (Boehringer Mannheim) and used to probe Southern 
blots of C. trachomatis L, genomic DNA that had been digested 

to completion with Hindlll. 
Lo The 0.6Kbp B/E probe from pAH306 hybridized to a 

-1.4Kbp Hindlll genomic fragment. Based on the experimentally 
derived restriction map of the HMW protein gene segment 
encoded on pAH306, this fragment encodes -0.2Kbp of the C- 
terminal HMW protein sequence. 
1.5 The radiolabelled 0 . 6Kbp B/E fragment was used 

subsequently to probe a moderately redundant ("5,000 primary 
clones) C. trachomatis L2 library to identify clones that 
contain the -1.4Kbp Hindlll fragment. Briefly, C. 
trachomatis genomic DNA was digested to completion using a 
20 -10-fold excess of the restriction endonuclease Hindlll ('10 
units per lug of genomic DNA, 3T>C, 18-24 hours) . Digestion 
products were size fractionated by agarose gel 
electrophoresis (0.8% agarose, TAE) and DNA fragments ranging 
in size from "l.OKbp to 2.0Kbp were excised from the gel. 
25 Excised agarose strips contain the desired DNA fragment sizes 
were dissolved in a solubilization/binding solution 
(QXl, QIAGEN) and purified using commercially available 
silica-gel spin columns (QIAGEN). Purified l.0-2.0Kbp 
genomic Hindlll fragments were then cloned into the 
30 pBlueScript SK- plasmid which had been previously digested to 
completion with Hindlll and treated with calf intestinal 
phosphatase to prevent vector religation. 

vector/insert ligations were performed in a "SOul 
final reaction volume (50mM Tris-HCl, pH 7.00; lOmM NaCl; ImM 
35 ATP; 0.5mM DTT) at 250C for "16-24 hours using T4 DNA ligase 
("10 units/reaction) and a vector : insert molar ratio of 
approximately 1:10. Following ligation, aliquots ("50ng 
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ligated DNA) were used to electroporate a competent E.coli 
host, e.g. E.coli TOPIO. Electroporated cells were then 
plated onto LB agar containing "lOOug/inl ampicillin to select 
for plasmid-harboring clones. Approximately 1,000 plasmid- 
5 harboring Ap« transf ormants were transferred directly from LB 
Ap'"- agar plates onto nylon membranes (HyBond N+, Amersham) by 

capillary action. 

Following transfer, plates were re-incubated at 
370c to regenerate viable colonies for further manipulation. 
10 colonies transferred to membranes were lysed and DNA 

liberated by treating the colony blots with a denaturing 
SDS/NaOH solution. A Tris buffered NaCl solution was used to 
neutralize and stabilize lysis material. Released DNA was 
immobilized onto the membranes by UV irradiation. Standard 
15 recombinant DNA procedures and methodologies were employed to 
probe the colony blots with the radiolabelled 0.6Kbp B/E 
fragment and identify recombinant derivatives which carry the 
desired "l.AKbp Hindlll fragment. 

Plasmid PAH310 was one derivative isolated by these 
20 procedures and the coding segment of the HMW protein is 
represented by nucleotides 994-2401 in Figure. 2. 

Restriction analysis using Hindlll and EcoRI, 
individually and in combination, together with DNA sequence 
analysis of purified plasmid DNA confirmed pAHSlO encodes the 
25 expected -1.4Kbp Hindlll fragment. These analyses also 
demonstrated that the -1.4Kbp insert consists of the same 
'1.2Kbp Hindlll-EcoRI fragment that is present in pAH306 and 
a unique -0.2Kbp EcoRI -Hindlll fragment that encodes C- 
terminal HMW protein-specific DNA. 
30 The -0.2Kbp EcoRI-Hindlll (E/H) fragment was chosen 

as the probe for the second chromosome walk. Briefly, pAH310 
was digested to completion with EcoRI and Hindlll and the 
digestion products size fractionated by agarose gel 
electrophoresis (0.8% agarose in TAE buffer). The desired 
35 -0.2Kbp (E/H) band was excised from the gel, purified, 
radiolabelled with [a-P'^]dATP, and used as a probe to 
identify clones in the original c. trachomatis L^ XZAPII 
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genomic library that encode the C-terminal segment of the HMW 

protein gene. 

Plasmid pAH316 is one derivative isolated by these 
procedures. Restriction analysis of pAH316 demonstrated that 
5 this derivative contains a C. trachomatis 1^ insert of "A.SKbp 
which consists of two EcoRI fragments of ~2.5Kbp and "2.0Kbp 
in size. Southern hybridization analysis using the '0.2Kbp 
E/H fragment as a probe localized this sequence to the 
-2.5Kbp EcoRI fragment of pAH316. Directional PGR analyses 

10 employing purified pAH316 plasmid DNA as a template and 

amplification primer sets specific for -0.2Kbp E/H fragment 
and T3 and T7 vector sequences demonstrated pAH316 encodes 
the C-terminal segment of the HMW protein gene. The coding 
segment of the HMW protein is represented by nucleotides 1974 

15 to 3420 in Figure 2, and is listed as SEQ ID No.:ll. 



Example 6. 

Production of Truncated HMW Rec ombinant Protein 

The N-terminal half of the HMW protein was PGR 

20 cloned as a "l.SKbp fragment into the commercially available 
E.coli expression plasmid. pTrcHisB (Invitrogen) . The forward 
primer used in these reactions was designated 140FXHO 
(57mer) , listed as SEQ ID No. 18, and contains sequences 
complementary to the first 10 N-terminal residues of the 

25 mature HMW protein. In addition to the HMW proteincoding 
sequences, this forward primer also carries a unique Xhol 
restriction site located optimally located upstream of the 
first residue of the mature HMW protein (Glu/E) for proper 
fusion to the (His)^ affinity purification domain encoded on 

30 the vector plasmid, and 5' terminal 6 base G/C clamp for 
effective amplification and a 12 base internal spacer for 
effective endonuclease recognition and digestion. 

SEP ID No. 18 5 - AAG-GGG-CCA-ATT-AGG-CAG-AGC-TCG-AGA-GAA- 
35 ATT-ATG-GTT-CCT-CAA-GGA-ATT-TAC-GAT - 3 ' 



SEP ID No. 19 5' - CGC-TCT-AGA-ACT-AGT-GGA-TC - 3' 
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The commercially available reverse sequencing primer SK 
(20mer, StrataGene) , sfo td No. 19 . which is complementary to 
phagemid sequences downstream of the EcoRI site in pAH306, 
was used as the reverse amplification primer in these 
5 reactions. To obtain acceptable yields of the HMW protein ORF 
product (-1.5Kbp), PGR amplification was performed using a 
mixture of thermostable DNA polymerases consisting of T. 
thermophilus DNA polymerase (Advantage Polymerase) , as the 
primary amplification polymerase and a minor amount of a 
10 second high fidelity thermostable DNA polymerase to provide 
additional 5' - 3' proofreading activity (CloneTech) . - An 
anti-Tth DNA polymerase antibody was added to the reaction 
mixture to provide automatic "hot-start" conditions which 
foster the production of large >2Kbp amplimers. pAH306 
15 plasmid DNA purified using a commercially available 

alkaline/SDS system (QIAGEN) and silica gel spin columns 
(QIAGEN) was used to program these amplification reactions 
("0.2ng/reac!tion) . 

The '1.5Kbp amplimer was purified from 
20 unincorporated primers using silica gel spin columns and 

digested to completion using an excess of Xhol and EcoRI ('10 
units per lug DNA) . The purified and digested N-terminal 
truncated HMW protein ORF was then be cloned into the 
commercially available expression plasmid pTrcHisB that had 
. 25 been previously digested with both Xhol and EcoRI (5:1, 

insert:vector ratio). Aliquots from the ligation reaction 
were then be used to electrotransf orm a suitable E.coli host 

(e.g. TOPIC) . 

Mini-prep DNA from ampicillin-resistant 

30 transformants picked at random were prepared, digested to 
completion with Xhol, EcoRI, or a combination of both and 
examined for the presence and orientation of the "1.5 Kbp 
truncated HMW protein ORF insert by agarose gel 
electrophoresis. Mini-prep DNA from clones determined to 

35 carry the "l.SKbp XhoI/EcoRI insert was prepared and used to 
program asymmetric PGR DNA sequencing reactions to confirm 
the fidelity of the junction formed between the HMW protein 
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fragment and the (His)^ affinity purification domain of . the 
expression vector. Plasmid pJJ36-J was one recombinant 
derivative isolated by these procedures and is represented by 
nucleotides 446 to 1977 in figure 2. The deduced amino acid 
5 sequence of the truncated fragment of HMW protein is 
represented by amino acids 29 to 532 in Figure 3 and is 
listed as SEQ ID No. 17. 

Example 7. 

10 n^termination of Presenc p- in Other Species 

Polymerase chain reaction analyses were undertaken 
to establish the presence of the HMW gene in several 
clinically recognized C. trachomatis strains and as well as 
other chlamydial species, e.g., C. pneumoniae. Chlamydia 
15 trachomatis strains as frozen stocks from the ATCC 

(Rockville, MD) were used to infect subconfluent monolayers 
(about 80%) of McCoy cells according to standard procedures. 
Infected monolayers were either centrifuged in a Sorvall 
RT6000B centrifuge ("1,300 rpm, 25°C, 30min) and/or treated 
20 with dextran sulfate ("50 ug/ml) at the time of infection to 
enhance initial attachment of the low infectivity biovars 
(non-LGV) to host cells and thus increase the final EB yield. 
Roughly 48 hours later, infected monolayers were collected by 
scraping and host cells disrupted by sonication to release 
25 elementary bodies (EEs) . Total DNA was extracted from 

purified EEs (-10''-10») of each strain using the proteinase K/ 
Nonidet P40 method described by Denamur, et al., J. Gen. 
Microbiol. 137:2525-2530 (1991), incorporated herein by 
reference, and further purified by phenol /chloroform 
30 extraction and salt precipitation. Purified Chlamydia 

pneumoniae (AR-139) genomic DNA was purchased from Advanced 
Biotechnologies Inc. 

To determine the presence of the HMW protein gene 
in these strains, amplification reactions were programmed 
35 using total Chlamydia DNA as template and the HMW protein 
segment-specific oligonucleotide primer (21mers) sets listed 
below. 
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^^^^ 5- - ^TG-GTT-CCT-CAA-GGA-ATT-TAC-0_- 3' 
„ T„ 2, 5' - GGT-CCC-CCA-TCA-GCG-GGA-G - 3 
^"^^"^^..X, stanaar. PCR a.plUication reactions (2 
lOO U.01 aNTPs, 0.75 units AMpliTag polymerase 50 ul 

. 1. voiu.e> were ^^^^ ^:zt::z:^^ 

oruae c.tracho»,atls DNA extracts ( lOul of 

avaiXa.ie C. p— ae OKA, ^ ^ / ^^^^ in 

reverse HMW protein-specific amplification p 
h! 2 ana 2X. Amplification of small target sequences 
XO 2K.P, was aCievea usin. a 3a-cycle, three-step tnerm 1 
profile, i.e. 95-C, 30 sec; 60-C, 30 sec 72 c. ^ J- 
^plification of longer target -^-""^//^j;:,: I^^^'in an 
se^uencin, was carriea ^^-^^ Tth/Vent OKA 

identical fashion except that a MAJd ma 

Uvmerase en.vme combination ^ -^useft"^^^^ 

Clontech) ana a 72-C extension time was usea 

size of the aesired PCR product plus 2 min (i.e. desired 

-^-^ = ::L:rn:ro:a:Tnd\:nra'i;tance PC. were 

.„ carried out usin, 0.2ml -----/^-f^cLr (PerKin- 
microcentrifuge tubes in an ABI 2400 Therm » 
.l.er, . Following thermal ^Vdin.- aliguots 0 1> of 
tactions were analyzea ana PCH P^-^^^f ^"'^ ^ f.^^,, 

standard agarose gal ^l-^-P-^""";^;" " ^;;results showed 
^ „ buffer) and ethidium bromide staining. The 

that the HMW protein is highly conserved in ^^^^^^^^ 

relevant serovars,- the HMW gene was P«-"^ ^ \ J 

sample, strains tested, including serovars BBa, D. E, 

H, I, J, K, L,. 1, and MoPn and in C. pne»oniae. 

30 

Bvam ple 8. 

DetermiJiaUm-Of^^aue^^ 

^7^;;;;;u;;r^h7degree of DNA and amino acid 

« ^iffprent Chlamydia strains, tne 
sequence variation among different cnx y 

3S gene for the HMW protein was PCH-clonea - ^ 

trachomatis B serovar <-P--"-"j;";;;;„:r , presenting 
organisms) ana from a c. trachor,atis F serovar 
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the low infect ivity STD biovars) and compared to the HMW 
protein consensus C. trachomatis Lj sequence. 

Briefly, LD-PCR was used to generate "eKbp HMW 
protein-specific DNA fragments from C. trachomatis B and F 
5 genomic DNA that contain the complete coding sequence for the 
mature HMW protein. Amplification conditions for these LD- 
PCR exercises were as described in Example 6. The reverse 
amplification primer employed in these reactions (p316Kpn-RC, 
56mer), listed as SEQ ID No. 13, is complementary to a 
10 sequence located "SKbp downstream of the predicted HMW 

protein termination codon. As an aide to cloning the desired 
-6Kbp amplimer, a single Kpnl restriction endonuclease site 
5' to the chlamydial sequence was engineered into the 
p316Kpn-RC primer. The forward amplification primer used for 
15 these reactions (p306Kpn-F, 56mer) , listed as SEQ ID No. 12, 
contains the sequence complementary to the first 10 amino 
acid residues (30 nucleotides) specifying the mature HMW 
protein as well as a 5 ' sequence specifying a Kpnl site. 
p306Kpn-F was designed such that the sequence encoding the N- 
20 terminus of the mature HMW protein could be linked in-frame 
to a hexa-His affinity domain encoded downstream of the 
highly efficient trc promoter on the E.coli expression vector 
pTrcHisB (ClonTech) when the "SKbp amplimer was inserted into 
the Kpnl site of this vector. 



25 



fiKO TP NO. 12 5 • -AAG-GGC-CCA-ATT-ACG-CAG-AGG-GTA-CCG-AAA- 
TTA-TGG-TTC-CTC-AAG-GAA-TTT-ACG-AT-3 ' 

SKO TP NO. 13 5 ' -AAG-GGC-CCA-ATT-ACG-CAG-AGG-GTA-CCC-TAA- 
30 GAA-GAA-GGC-ATG-CCG-TGC-TAG-CGG-AG- 3' 

The -6 Kbp HMW protein products were purified using silica- 
gel spin columns (QIAGEN) and the fragments subjected to two 
8-10 hour cycles of Kpnl digestion using a 10-fold excess of 
35 Kpnl (-10 units per 1 ug of purified fragment, 37°C) . 

Following the second digestion, residual restriction enzyme 
activity was removed using QIAGEN spin columns and the "6 Kbp 
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Kpnl HMW protein fragments cloned into the pTrcHisB plasmxd 
which had been previously digested to completion with Kpnl 
and treated with calf intestinal phosphatase to prevent 

vector religation. 
5 vector/ insert ligations were performed in a 'SOul 

final reaction volume (50mM Tris-HCl, pH 7.00; lOmM NaCl; ImM 
ATP; 0.5mM DTT) at 25''C for "2 hours using T4 DNA ligase ( 10 
units/reaction) and a vector : insert molar ratio of 
approximately 1:5. Following ligation, aliquots ("SOng 
10 ligated DNA) was used to electroporate a competent E.colz 
host, e.g. E.coli TOPIO. Plasmid-harboring transf ormants 
were selected by plating electrotransf ormed cells onto LB 
agar containing 100 ug/ml ampicillin. Ampicillin-resistant 
(Ap'^) transformants appearing after a "18-24 hour incubatxon 
15 period at 37 "C were picked at random and restreaked onto the 
same selective media for purification. 

A single, purified Ap'^ colony from each initial 
transf ormant was used to inoculate "Sml of LB broth and grown 
overnight at 37«>C in a incubator shaker with mild aeration 
20 (-200 rpm). cells from broth cultures were harvested by 
centrifugation and used to prepare small quantities of 
plasmid DNA. commercially available reagents (QIAGEN Plasmxd 
Mini Kits) were employed for these plasmid extractions. 
Plasmid derivatives carrying inserts were presumptively 
25 identified by electrophoresing the non-digested plasmid DNA 
in agarose gels (0.8% agarose in TAE buffer) and identifying 
derivatives greater in size than vector plasmid. Insert- 
containing derivatives were confirmed and the orientation of 
the HMW protein inserts relative to- vector sequences were 
30 determined using appropriate restriction endonucleases (Kpnl, 
EcoRI, Hindlll, SamHI, etc.), either separately or together 
in various combinations. 

The DNA sequence of the C. trachomatis B and F HMW 
protein genes were obtained for both strands using "sequence 
35 walking" the asymmetric dye-terminator PCR cycle sequencing 
methodology (ABI Prism Dye-Terminator Cycle Sequencing, 
Perkin-Elmer) described in Example 4. Reactions were 
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programmed with plasmid mini-prep DNA and individual HMW 
protein sequence-specific primers that were employed in the 
sequencing of the HMW protein gene from the L, type straxn. 

DNA sequence data were collected using the ABI 310 
5 sequenator and analyzed automatically on a PowerMAC computer 
and appropriate computer software as described in Example 4. 
individually autoanalyzed DNA sequences were edited manually 
for accuracy before being merged into a consensus sequence 
"string" using AutoAssembler software (Perkin-Elmer) . Both 
10 strands of the HMW protein gene from the C. trachojnatis B and 
F serovars were sequenced and these data compiled to create 
composite consensus sequences for both the C. trachon.atis B 
and F HMW protein genes. These sequences are listed as SEQ 
ID Nos.: 14 and 15. Sequence comparisons of the I^, F and B 
15 strains are presented in Figure 6. 

Tixam ple 9. 

p^orinnhion of Rp-combir '?^nt-. Protein 

TO produce sufficient quantities of recombinant HMW 
20 protein for both immunogenic ity and animal protection 
studies, the HMW gene has been PGR cloned into suitable 
E.coli and baculovirus expression systems. Large quantities 
of rHMW protein are produced in an E.coli - based system as a 
chimeric fusion protein containing an N-terminal (His), 
25 affinity purification domain. The complete HMW protein open 
reading frame (ORF) was PCR-cloned from the C. tracho;natas 
genome as a single Kpnl fragment and fused in the proper 
orientation and in the correct reading frame to the (His), 
affinity purification domain encoded on the high expression 
30 plasmid vector pTrcHisB (CloneTech) as described in Example 

The (His), affinity purification domain is part of a 
high expression locus consisting of the highly efficient tac 
promoter (IPTG-inducible) and consensus Shine and Delgarno 
35 ribosome binding site (RES) located immediately upstream of 
the (His), affinity purification domain. The HMW protein 
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genes from C. trachomatis LGV h,, C. trachomatis B, and 
C.trachomatis F were PGR cloned as "S-OKbp fragments. The 
forward primer (56mer) used in these reactions was designated 
p306Kpn-F and contains sequences complementary to the first 
5 10 N-terminal amino acid residues of the mature HMW protein, 
listed as SEQ ID No 12. In addition to the HMW protein 
coding sequences, this forward primer also carries a unique 
Kpnl restriction site located optimally located upstream of 
the first residue of the mature HMW protein (Glu) for proper 
10 fusion to the (His), affinity purification domain encoded on 
the vector plasmid, and 5' terminal 6 base G/C clamp for 
effective amplification and a 12 base internal spacer for 
effective endonuclease recognition/digestion. The reverse 
PGR primer, designated p316Kpn-3RC, contains a reverse 
15 complement sequence to a C. trachomatis sequence located 
-0.2Kbp downstream of the HMW protein termination codon, 
listed as SEQ ID No. 14. As with p306Kpn-F, the reverse 
primer also contains a Kpnl restriction site 5' to the C. 
trachomatis sequences, a 6 base G/G clamp, and a 12 base 

20 internal spacer. 

TO obtain acceptable yields of the HMW protein ORF 
product (about 3,500bp), PGR amplification was performed 
using a mixture of thermostable DNA polymerases consisting of 
T. thermophilus DNA polymerase as the primary amplification 
. 25 polymerase and a minor amount of a second high fidelity 
thermostable DNA polymerase to provide additional 5' - 3' 
proofreading activity (Advantage Polymerase, GloneTech) . An 
anti-Tth DNA polymerase antibody was added to the reaction 
mixture to provide automatic "hot-start" conditions which 
30 foster the production of large (>2Kbp) amplimers. 

Genomic DNA from the various C.trachomatis strains 
was isolated from EBs as described in the example above and 
used to program these reactions. Following amplification, 
the desired reaction products were purified from excess 
35 primers using commercially available silica-gel spin columns 
and reagents (QIAGEN) and digested to completion with an 
excess of Kpnl ("10 units per lug DNA). The purified and 
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digested Kpnl HMW protein ORF was then be cloned into the 
Kpnl predigested pTrcHisB expression plasinid (5:1, 
insert: vector ratio) . Aliquots from the ligation reaction 
were then used to electrotransf orm a suitable E.coli host 

5 (e.g. TOPIO) . 

Mini-prep DNA from ampicillin-resistant 
transformants picked at random were prepared, digested to 
completion with Kpnl, Hindlll, or a combination of both and 
examined for the presence and orientation of the "3.2 Kbp HMW 
10 protein ORF insert by agarose gel electrophoresis and 

ethidium bromide staining. Mini-prep DNA was used to program 
asymmetric PCR DNA sequencing reactions as described in 
example (s) above to confirm the fidelity of the junction 
formed between the HMW protein fragment and the (His)^ 
15 affinity purification domain of the vector. Plasmid pAH342 
was one derivative isolated by these procedures, which 
contains the HMW protein gene ORF from C. trachomatis and 
is represented by nucleotides 446 to 3421 in Figure 2. 

Recombinants were grown in 2X-YT broth containing 
20 lOOug/ml Ap to mid- log phase ("0.5 O-D.^oo) and induced with 
IPTG (ImM final) for an additional 4-5 hours to activate 
transcription from the vectors trc promoter. Cells were 
harvested by centrifugation and crude cell lysates prepared 
by lysis using a French pressure cell. 
25 Alternatively, expression of rHMW protein may be 

obtained by using a baculovirus expression system. Here, the 
HMW protein ORF from C. trachomatis and C .trachomatis F were 
PCR-cloned as -3Kbp PCR products into a baculovirus transfer 
vector (e.g. pFastBacHTb) that had been previously digested 
30 to completion with Kpnl and treated with CIP to minimize 

vector religation in essentially the same manner as described 
for pTrcHisB. The HMW protein expression cartridge generated 
in this cloning exercise (i.e. the baculovirus polyhedron 
promoter, N-terminal (His)6 affinity purification domain, HMW 
35 protein gene ORF) was then transferred to the baculovirus 
genome by site-specific transposition using a commercially 
available bacmid system (Bac-to-Bac, Gibco) 
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Briefly, the HMW protein baculovirus expression 
plasmid was used to transform competent E. coli DHlObac 
(Gibco) cells containing a bacmid (a hybrid baculovirus- 
plasmid replicon) to gentamicin resistance using standard 
5 transformation and selection methodologies. Transformants 
where the HMW protein expression cartridge had successfully 
transposed from the expression plasmid to the appropriate 
receptor site within the lacZ gene located on the bacmid 
replicon were identified using a standard IPTG/X-gal blue- 

10 white selection. 

White, Gm« transformants were picked at random and 
restreaked for purification. Bacmid DNA was prepared from 
broth cultures by the method of Ish-Horowitz, N. A. R. 
9:2989-2993 (1981) incorporated herein by reference, and is 

15 used to transfect Spodoptera frugiperda 9 cells. Following 
■ plaque purification, recombinant HMW protein baculovirus is 
used to infect large scale Spodoptera suspension cultures. A 
yeast expression system is used to generate a glycosylated 
form of HMW protein. 

20 

Bxam ple 10. 

Purification of Recombi nant Protein 

Recombinant HMW protein was purified to homogeneity 
using standard preparative immobilized metal affinity 
25 chromatography (IMAC) procedures. Briefly, an E. coli strain 
harboring an expression plasmid containing HMW protein gene 
was grown in Luria broth in a .51 fermenter (New Brunswick) at 
370c with moderate aeration until mid-log phase ('0.5 O.D.eoo) 
and induced with IPTG (ImM final) for 4-5 hours. Cell paste 
30 was collected, washed in PBS and stored at -20«'C. Aliquots 
of frozen cell paste ("g-lOg wet weight) were suspended in 
-120ml of D-PBS by mechanical agitation and lysed by passage 
through a French pressure cell (2X, 14,000psi, 40C) . Soluble 
protein was then removed from rHMW protein inclusion bodies 
35 by high speed centrif ugation (-10,000Xg, 4«'C, 3 0min) . 

The insoluble pellet containing rHMW protein was 
suspended in -20ml of ice cold D-PBS by homogenization and 
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centrifuged as above. Washed rHMW protein inclusion bodies 
were then denatured by suspension in a sodium phosphate 
buffer (O.IM, pH7.0) containing 6M guanidine hydrochloride 
and loaded onto a Ni^^-af f inity column (1.5cm X 25cm, bed 
5 volume -IBxal) prepared from Fast-Flow Chelating Sepharose 
(Pharmacia) and charged with Ni^'^or Zn^* ions by standard 
procedures. Unbound material was removed by washing the 
column with "5-10 column volumes of a sodium phosphate buffer 
(O.IM, pH7.0) containing 8M urea. 
10 Recombinant HMW protein bound to the affinity resin 

by virtue' of the N-terminal (His)^ affinity purification 
domain was eluted using a pH 4.0 sodium phosphate/8M urea 
buffer (-20ml). Eluted material was neutralized by the 
addition of l.OM Tris-HCl (-2.5ml, pH 7.5) and dialyzed 
15 against TE buffer containing SDS (0.5%) to remove the urea. 
Dialyzed material was concentrated using a Centricon-30 
centrifugal concentrator (Amicon, 30,000 MWCO) and mixed with 
a 1/5 volume of 5X SDS gel sample buffer containing ImM 2- 
mercaptoethanol (Lammeli) and boiled at 100°C for 5min. 
20 Samples were loaded onto Tris/glycine preparative 

acrylamide gels (4% stacking gel, 12% resolving gel, 30:0.8 
acrylamide: bis solution, 3mm thickness) . A prestained 
molecular weight standard (SeeBlue, Novex) was run in 
parallel with the rHMW protein samples to identify size 
25 fractions on the gel. The area of the gel containing 

proteins having molecular masses of "50-70 Kdal was excised 
and the proteins electroeluted using an Elu-Trap device and 
membranes (S&S) as specified by the manufacturer. 
Electroeluted protein was dialyized to remove SDS. The 
30 protein concentration of the sample was determined using a 
Micro-BCA system (Pierce) and BSA as a concentration 
standard. The purity of rHMW protein was determined using 
conventional SDS-PAGE and commercially available silver 
staining reagents (Silver Stain Plus, Novex) as shown in 
35 Figure 4. 

The apparent molecular weight of the isolated 
mature rHMW is about 105-115 kDa as determined by SDS-PAGE. 
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Examol 11. 

Preparation of Antibodies to HMW Protein 

Polyvalent antisera directed against the HMW 
protein were generated by vaccinating rabbits with the 
5 purified HMW protein or fragments thereof. Each animal was 
given a total of three immunizations of about 250 ug HMW 
protein or fragment thereof per injection (beginning with 
complete Freund's adjuvant and followed with incomplete 
Freund's adjuvant) at approximately 21 day intervals. At 
10 each immunization, approximately half of the material was 
administered intramuscullarly (i.m.) and half was injected 
intranodally. Fourteen days after the third vaccination a 
fourth booster of about 100 ug HMW protein was given i.m. and 
the animals exsanguinated 7-10 days later. Anti-HMW protein 
15 titers were measured by ELISA using purified HMW protein (1.0 
ug/well) or C. trachomatis Lj EBs (whole and crude protein 
extracts) as capture ligands. Immunogen specific IgG ELISA 
titres of 1/320,000 were observed using purified rHMW 
truncated protein and 1/2500 using either EBs or RBs. 
20 Serial dilutions of antisera were made in PBS and 

tested by ELISA in duplicate; Goat HRP-conjugated. anti- 
rabbit antibody diluted 1/1000 was used as the second 
reporter antibody in these assays. Titers are expressed as 
the greatest dilution showing a positive ELISA reaction, i.e. 
. 25 an O.D.450 value >2SD above the mean negative control value 
(prebleed rabbit sera) . Hyperimmune antisera was then used 
to probe Western blots of crude EB or RB extracts as well as 
1.0% OOP EB extract preparations to determine whether other 
C. trachomatis serovars and Chlamydia species express the HMW 
30 protein. C. trachomatis serovars B, F, L,, MoPn and Chlamydia 
pneumoniae were tested and found to have a protein of an 
apparent molecular weight of 105-115 KDa reactive with 
antisera generated against HMW protein. 
Example 12 

35 Surface localization of the HMW protein on 

different Chlamydia strains and derivatives were examined by 
indirect fluorescence antibody (IFA) . IFA was performed 
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using the procedures generally known in the art using 
hyperimmune anti-HMW protein as the primary antibody. Hak 
cells infected with whole EBs from one of C. trachomatis 
serovars Lj, B, and F, C. pneumoniae or C. psittaci are 
5 achieved by the following method. 

McCoy or Hak cells were grown to confluence in D- 
MEM media on 12mmm plain coverslips inside 24 well tissue 
culture plates then centrifugally inoculated with "5X10"^ 
inclusion forming units (IFU) of the C. trachomatis strain 

10 Nil (serover F) . After "24 hours incubation, the culture 

media was removed and infected cells fixed inmethanol for 10 
min. The fixed monoloayer was then washed with PBS (IX) to 
remove fixative and overlayer with >300 ul of anti-60Kdal 
truncated HMWP rabbit antibody that had been diluted "1/100 

15 in PBS. After 1 hour incubation with the primary antibody, 
the cells were washed gently with PBS then incubated for "30 
min. with a 1/200 dilution of a mouse anti-rabbit IgG 
antibody conjugated with FITC. The second antibody was 
diluted using a PBS solution containing 0.0091% Evans Blue as 

20 a counter stain to visualize the monolayer. Cells were 
washed 2X in PBS to remove the secondary antibody, the 
coverslips removed from the culture plates, and mounted onto 
microscope slides using a fluorescent mounting medium. 

Identical cell samples stained with prebleed rabbit 

25 antibody or FITC-con jugated second antibody alone were 
processed in parallel and served as antibody specificity 
(negative) controls. Counter stained samples were examined at 
a 1000-X magnification with a Zeisis Axioskop photomicroscope 
equipped with plan-neoflur objectives. Results using C. 

30 trachomatis Nil (F serovar) are shown in Figure 7. The 
results show that enhanced fluorescence of samples stained 
with HMW protein antibody compared to the controls confirmed 
the surface location of the HMW protein. Furthermore, 
fluorescence of samples stained with HMW protein antibodies 

35 show binding to surface localized HMW protein from I^, B and 
MoPn serovars and C. pneuomoniae. 
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9. UTILITY 

The in vitro neutralization model has been used to 
show that protective antiserum inhibited chlamydial infection 
(neutralization) of various tissue culture cell lines. 
5 Animal models are also essential for testing vaccine efficacy 
with both small animal (non-primate) and primate models 
necessary for preclinical evaluation. The guinea-pig has 
been used for studying experimental ocular and genital 
infection by the Guinea-pig inclusion conjunctivitis agent 

10 (GPIC) , C. psittaci. 

The mouse offers a consistent and reproducible 
model of genital tract infection using human genital tract 
isolates. This mouse model is a generally accepted pre- 
clinical assay, and was used to evaluate MOMP as a subunit 

15 vaccine. Another model is known as the primate model of 

trachoma infection wherein the induction of secretory IgA was 
shown to be a prime component of protection. Vaccinogenic 
ability of new subunit antigen candidates is determined using 
the above-mentioned generally accepted in vitro 

20 neutralization and animal model systems. 
Wvampl^ 13. 

In Vitro Neutralization Model 

As a preliminary exercise to the animal protection 
studies, hyperimmune ant i-HMW antibody was evaluated for its 

25 ability to block the infectivity of various C .trachomatis 
serovars (e.g. L2,B,E) in vitro. Although Hela cells were 
used to propagate Chlamydia, these cells also allow antibody- 
mediated uptake via Fc receptors. Therefore, to evaluate 
anti-HMW antibody inhibition of infectivity, Hak cells, which 

30 do not display Fc receptors, were used in these analyses. 

Cells were grown on covers lips in 2 4 -well plates to 
a subconfluent monolayer (about 90% confluency = 1X10* 
cells/ml) at 37 °C in 5% COj. Anti-HMW-antibody was diluted to 
about 100 ug/ml (total protein) in sucrose-phosphate- 

35 glutamate (SPG) buffer and then serially diluted in SPG 

buffer. Frozen aliquots of pretitered Chlamydia was diluted 
in SPG buffer to about 2X10^ IFU/ml. EBs were premixed with 
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the diluted anti-HMW-antibody to about 10-20 IFU/ul and 
incubated 30 minutes at 37 °C on a rocking platform. 

Prepared Hak cells were washed in HBSS and then 
incubated with the anti-HMW-antibody/Chlamydia EB mixture in 
5 triplicate for each antibody using 500 IFU/ml. Plates were 
incubated for 2 hours at 37<>C, then the inoculum removed and 
plates washed 3 times with HBSS. Tissue culture media 
containing 1 ug/ml of cyclohexamide was added and plates 
incubated for about 24-36 hours at 37°C in 5% COj to allow 

10 inclusion bodies to develop. After incubation, the media was 
removed and cell monolayers washed 3X in PBS. Plates were 
fixed in methanol for 20 minutes and re-washed in PBS. 

Cells were stained to visualize inclusions by 
incubating with anti-Chlamydia LPS antibody (diluted about 

15 1:500, ViroStat) , cells washed 3 times in PBS, followed by 
incubation with FITC-conjugated goat secondary antibody for 
30 minutes at 37°C. Coverslips were washed, air dried, and 
mounted in glycerol on glass coverslips. Inclusions were 
counted in five fields through the midline of the coverslip 

20 on a Zeiss fluorescence photomicroscope. Results are 

reported as the percent reduction of inclusion-containing 
cells with respect to a heterogenous antibody control (rabbit 
prebleed sera) . 

25 Example 14. 

Mouse Genital Infectivitv Model 

HMW protein is evaluated as an immunogen and a 
vaccinogen using the generally accepted mouse C .trachomatis 
genital infect ivity model. HMW protein is evaluated as an 

30 immunogen and for the ability to protect BALB/c mice against 
challenge with various C. trachomatis serovars (Lj/ B, E) . 
HMW protein is administered to groups of Chlamydia-f ree 
animals by three different immunization routes: oral, nasal 
and subcutaneous. For each route, the immunogenicity of HMW 

35 protein is determined for the protein alone and in 

combination with an appropriate adjuvant (s). After the first 
immunization, animals are periodically sacrificed and serum 
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IgG and mucosal (cervix/vagina and intestinal) sIgA levels 
determined using well known methodologies. 
Immunization of Mice; Six-to-eight week old (sexually 
mature) , specific-pathogen free, female mice are administered 
5 with the HMW protein as described below. , 

For parenteral administration, the classic route 
for delivering recombinants subunits and toxoids to humans, 
HMW protein subunit is given subcutaneous ly to unanethesized 
mice. For oral immunization, animals are withdrawn from 

10 rations 4 hours before dosing. HMW protein is administered 
intragastrically to unanesthetized mice. Intragastrically 
vaccinated mice are returned to solid rations approximately 
3-4 hours after immunization. Mice to be vaccinated nasally 
. are sedated lightly, placed on their backs, and administered 

15 with HMW protein. 

Determination of Serum and Mucosal Antibo dy Levels: 
Beginning immediately after the first immunization and 
continuing at 7 day intervals thereafter, animals from. each 
vaccination group are anesthetized, the abdominal cavity 

20 opened and the animal exsanguinated by cardiac puncture. 

Immediately thereafter, the lower reproductive tract (cervix 
and vagina) and small intestine are surgically removed. 
Mucosal secretions are collected from the intestine and 
cervix/vagina by gently scrapping prewashed and dissected 
. 25 organs with a sterile scalpel blade. Sera and mucosal 

secretions are stored in PBS at -7 0°C until the end of the 
experiment and analyzed as a group. . 

Chlamydial IgG and secretory IgA levels in serum 
and mucosal secretions are determined by ELISA. Titers to 

30 both whole EB lysates and HMW protein are determined. 

Briefly, intact purified C .trachomatis 1^ EBs or HMW protein 
is diluted in 0.05 M sodium carbonate buffer and used to coat 
Immulon-3 (DynaTech) 96 well microtiter plates. After 
blocking with 1% BSA/PBS/0 . 05% Tween-20 and extensive washing 

35 (3X; PBS/0.05% Tween-20) serum or mucosal secretion samples, 
serially diluted in PBS, are added and the plate incubated at 
37*'C for 1 hour. All samples are tested in duplicate. 
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Unbound material is removed by washing. Affinity-purified 
HRP- conjugated to either goat anti-mouse IgA (alpha chain) 
or goat anti-mouse IgG (Vector Labs), diluted 1/5,000 in PBS, 
is then added and the plate reincubated at 37 °C for 1 hour, 
5 Secondary antibody is removed, the plate washed again and 
substrate (TMB) added. 

The color change is measured in a microplate 
spectrophotometer at 450 nm after a 30 minute incubation at 
room temperature and quenching with H2SO4. Readings >2 SD of 

10 the mean negative control value (pooled prebleed sera, pooled 
mucosal secretions from unvaccinated animals) is defined as 
positive. Reaction specificity is monitored by preabsorbing 
the primary antibody with antigen (antibody-blocking) and the 
secondary antibody with purified mouse IgG/IgA (conjugate- 

15 blocking). Antibody titers for each data point (5 

animals/point) is presented as the geometric mean ± S.D. of 
the last positive dilution. 

C. trachomatis challenge: Two weeks after the third 
immunization, animals are challenged intravaginally , while 

20 under mild anesthesia, with a single dose of 0.1 ml 

endotoxin-free PBS containing lO^IFU of purified, pretitered 
C. trachomatis EBs. Progesterone is administered (about 2.0 
mg p^r dose, i.m.) one week prior to and the time of 
challenge to block estrous and ensure infection of mouse 

25 cervical epithelial cells with human C. trachomatis strains. 
The presence and persistence of C. trachomatis in the lower 
reproductive tract of vaccinated animals is assessed using 
both a commercial Chlamydia-specif ic ELISA (Chlamydiazyme, 
Abbott Diagnostics) and by in vitro cultivation. At 7, 14, 

30 and 21 days post-challenge, animals are sacrificed as above 
and their lower reproductive tracts (cervix/ vagina) and small 
intestine surgically removed as above. 

Tissue homogenates are prepared by macerating and 
homogenizing identical amounts of tissue in 1.0 ml SPG 

35 buffer. Clarified samples are serially diluted and tested 
for Chlainydia-specif ic antigen by commercial ELISA and used 
to infect McCoy cells grown to about 90% confluency in 24- 
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well tissue culture plates. Each dilution is assayed in 
duplicate. After a 24 hour cultivation period, infected 
monolayers are fixed with methanol and inclusion bodies 
visualized by indirect fluorescence antibody staining using 
5 an ant i-Chlamydia LPS antibody. Fluorescent inclusions are 
counted at a 4 OX magnification and the resulting titer 
expressed as the mean number of inclusions per 20 fields. 
Chlamydia IgG and sIgA levels in the serum and intestine are 
also determined for these animals as detailed above. 

10 Protection is defined as the ability to eliminate or reduce 
the level of C. trachomatis in the lower genital tract. 

To determine whether vaccination with HMW protein 
protects mice against heterotypic challenge, equivalent 
groups of mice are immunized with the HMW protein and 

15 subsequently challenged with either C. trachomatis serovar B 
or E. 

Other equivalents of the present invention may be 
readily determined by those skilled in the art and such 
equivalents are intended to be included in this invention. 

20 The foregoing disclosure includes all the information deemed 
essential to enable those skilled in the art to practice the 
claimed invention with out undue experimentation. Because 
the cited patents or publications may provide further useful 
informatibn, all the cited materials are hereby incorporated 

25 by reference herein in their entireties. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION 

(i) APPLICANT: Jackson, W. James 
Pace, John L, 

(ii) TITLE OF THE INVENTION: Chlamydia Protein, Gene Sequence 

And Uses Thereof 

(iii) NUMBER OF SEQUENCES: 37 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Pennie fit Edmonds LLP 

(B) STREET: 1155 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: NY 

(E) COUNTRY: USA 

(F) ZIP: 10036-2711 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Baldwin, Geraldine F. 

(B) REGISTRATION NUMBER: 31,232 

(C) REFERENCE/DOCKET NUMBER: 7969-062 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 790-9090 

(B) TELEFAX: (212) 869-8864 

(C) TELEX: 66141 PENNIE 

(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4435 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GGGCAAAACT CTTCCCCCCG GGATTTATAT GGGAAAGGGG AAACTTTGGC CCGTATTCAA 60 

GCGCCACGGG TTTTGGGGCG GAATGAATTT TTTCGTTCCG GAAAAAGTAA TTCCCCGGGA 120 

ACGTAGGGTA TCGGTTTCAT AGGCTCGCCA AATGGGATAT AGGTGGAAAG GTAAAAAAAA 180 

CTGAGCCAAG CAAAGGATAG AGAAGTCTTG TAATCATCGC AGGTTAAAGG GGGGATGTTA 240 

TTTTAGCCTG CAAATAGTGT AATTATTGGA TCCTGTAAAG AGAAAAGGAC GAATGCGCTG 300 

AAGATAAGAA CATTTATTGA TATTAAATTA TTAATTTTTT ATGAAGCGGA GTAATTAATT 360 

TTATCTCTCA GCTTTTGTGT GATGCAAACG TCTTTCCATA AGTTCTTTCT TTCAATGATT 420 

CTAGCTTATT CTTGCTGCTC TTTAAATGGG GGGGGATATG CAGCAGAAAT CATGGTTCCT 480 

CAAGGAATTT ACGATGGGGA GACGTTAACT GTATCATTTC CCTATACTGT TATAGGAGAT 540 

CCGAGTGGGA CTACTGTTTT TTCTGCAGGA GAGTTAACAT TAAAAAATCT TGACAATTCT 600 

ATTGCAGCTT TGCCTTTAAG TTGTTTTGGG AACTTATTAG GGAGTTTTAC TGTTTTAGGG 660 
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AGAGGACACT 
AATAGCGCTG 
TGCAATTCAT 
ACGACAACAT 
AATAATGAGA 
GCTAAGAGCT 
CAAGCTGATG 
c CCTATTGCCT 
GATGGGCAGC 
AGAAATACTG 
TACGGGAACG 
CCTGTTTACA 
TACGGAGATG 
GGATCAGTTT 
AAAGGGGGAG 
TTAAGGAATA 
10 TTATCTGCTG 
AATGCTGCCG 
GGGAAAATAA 
GAGATGGCAA 
GGTGAAGGAT 
GTTACGATAG 
CTAAGTCAGA 
CCACAACCAC 
TTGTCTCTTT 
GCGCAAGATT 
GGGCCTATCT 
TCTAATCAAA 
CCATCAGATT 
CTTGCGTGGG 
AAAACTGGGT 
GGATCCATTT 
TCTTATTGTC 
GCTTTAGGTC 
2 0 TTTGGATCAT 
GTAGTGTGTC 
GCTTTATGTG 
AATCAGCATA 
AACTGTCTGG 
TATTTGAATG 
TTTACAGAGG 
GTTCCTGTTG 
ATGGCGGCTT 
2 5 TCCCATCAAG 
AGAGGATCTA 
GAGTATCGAG 
AAATATTGGT 
TGTTTTTTAG 
ACGCCTAATG 
TTTGTCAATG 
TTAACGATTA 
CAAAATTATG 
3 0 ATGTTCTCGA 
ATTTCCGGAG 
ACTGTGCCAA 
TCTATTTTTT 
GATAATAATG 
GGTGGAGGCA 
AGGGAAAGTT 
CATTGTGCTT 
TGATTTAAGG 
35 AGGAGTTATT 
TTTTGATTAC 



CGTTGACTTT 
CTGATGGACT 
TACTTGCCGT 
CTACACCGTC 
AGTTCTCATT 
TAACGGTTCA 
GGGGAGCTTG 
TTGTAGCGAA 
AGGGAGTGTC 
CGGTAGAGTT 
TTGCTTTCCT 
TTGCTGCTAA 
GAGGAGCTAT 
CCTTTGATGG 
CTATTTATGC 
TCGCTAATGA 
ATTATGGAGA 
ATGTTAATGG 
CGACATTAAG 
ACGGAAATAA 
ACACAGGGGA 
AGCAAGGAAG 
CAGGTGGGAG 
CACAACAGCC 
CTTCTTTGTT 
CTCATCCTGC 
TTTTTGAGGA 
AAATCAATGT 
TGACTCTAGG 
ATCCTAATAC 
ATAATCCTGG 
TAGATATACG 
GAGGATTATG 
AGGGATATCG 
CGATGTTTGG 
GTTCCAATCA 
GATCCTATTT 
TGAAAACCTC 
CTGGAGAGAT 
AGTTGCGTCC 
AAGGCGATCA 
GAGTGAAGTT 
ATATCTGTGA 
AGACATGGAC 
TGTATGCTTC 
ATGCTTCTCG 
TAGATAGTTA 
CTTGTTTGTG 
TAACCACTCC 
TCTATGCAGG 
CCGGACAAAA 
CCTTCATTTC 
AAAATGTTTC 
CAGGCGAAGT 
CCTCATCATC 
CTGTAGAGAC 
CCGGGAATTT 
GTGGGTTCCG 
TCTTTCACAG 
TTCAAAGACA 
ATTCTTGCTG 
TGCTCTCCAG 
AACTTTGCAA 



CGAGAACATA 
GTTTACTATT 
ACTGCCTGCT 
TAATGGTACT 
CTATAGTAAT 
AGGAATTAGC 
TCAAGTAGTC 
TGTTGCAGGA 
ATCATCTACT 
TGATGGGAAC 
GAATAATGGA 
GCAACCAACA 
CTTCTGTAAG 
AGAGGGAGTA 
CAAAAAGCTC 
TGGTGGAGCG 
TATTATTTTC 
CGTAACTGTG 
AGCTAAAGCA 
CCAGCCAGCG 
TATTGTTTTT 
GATTGTTCTT 
TCTGTATATG 
TCCTGCCGCT 
AGCAAACAAT 
AGTCATTGGT 
TTTGGATGAT 
CCTGAAATTA 
GAATGAGATG 
AGCAAATAAT 
GCCTGAGCGA 
ATCTGCGCAt 
GGTTTCTGGA 
GTATATTAGT 
TCTAGCATTT 
TCATGCXTGC 
GTTCGGAGAT 
ATATACATTT 
TGGAGCGGGA 
TTTCGTGCAA 
AGCTCGGGCA 
TGATCGATGT 
TGCTTATCGC 
AACAGATGCC 
TCTAACAAGT 
AGGCTATGGT 
AGTGTTAGCG 
TTCCTATTCG , 
TTTTAAGGGA 
AGCTGAAGAA 
CCATACATTA 
AGCAGGAGAG 
TTGCGGAGAA 
GATTTTCTGG 
AACTCCGCCT 
TAGTTTGGAG 
CGGAACAGTT 
CTACACCATC 
ATAACGTAGC 
ATGAAGGAGG 
CTACTAATCA 
ATGATTCTGT 
AAGGCAGAGG 



CGGACTTCTA 
GAGGGTTTTA 
GCAACGACTA 
ATTTATTCTA 
TTAGTCTCTG 
AAGCTTTGTG 
ACCAGTTTCT 
GTAAGAGGGG 
TCAACAGAAG 
GTAGCCCGAG 
AAAACCTTGT 
AGTGGACAGG 
AATGGTGCGC 
GTTTTCTTTA 
TCGGTTGCTA 
ATTTATTTAG 
GATGGGAATC 
TCCTCACAAG 
GGGCATCAGA 
CAGTCTTCCA 
GCTAATGGAA 
CGTGAAAAGG 
GAAGCTGGGA 
AATCAGTTGA 
GCAGTTACGA 
AGCACAACTG 
ACAGCTTATG 
CAGTTAGGGA 
CCTAAGTATG 
GGTCCTTATA 
GTAGCTTCTT 
TCAGCAATTC 
GTTTCGAATT 
GGGGGTTATT 
ACCGAAGTAT 
ATAGGATCCG 
GCGTTTATCC 
GCAGAGGAGA 
TTACCGATTG 
GCTGAGTTTT 
TTCAAGAGCG 
TCTAGTACAC 
ACCATCTCTG 
TTTCATTTAG 
AATATAGAAG 
TTGAGTGCAG 
ATGCCTTTTT 
TATGGATTCG 
GACGATGTTT 
GGTTCGATTA 
TCATTTACAG 
ACACTTACTC 
AAGGGAATGA 
GATAACTCCG 
GCTCCAACAG 
ATCTCAGGCG 
TTTCGAGGTA 
AAGTACGACT 
CTCTTGCGGA 
CATATTCTTC 
GGATCAGAAT 
AAAGTTTGAA 
CGGAAGCATC 



CAAATGGGGC 
AAGAATTATC 
ATAAGGGTAG 
AAACAGATCT 
GAGATGGGGG 
TCTTCCAAGA 
CTGCTATGGC 
GAGGGATTGC 
ATCCAGTAGT 
TAGGAGGAGG 
TTCTCAACAA 
CTTCTAATAC 
AAGCAGGATC 
GTAGCAATGT 
ACTGTGGCCC 
GAGAATCTGG 
TTAAAAGAAC 
CCATTTCGAT 
TTCTCTTTAA 
AACTTCTAAA 
GCAGTACTTT 
CAAAATTATG 
GTACATGGGA 
TCACGCTTTC 
ATCCTCCTAC 
CTGGTTCTGT 
ATAGGTATGA 
CTAAGCCCCC 
GCTATCAAGG 
CTCTGAAAGC 
TGGTTCCAAA 
AAGCAAGTGT 
TCTTCTATCA 
CCTTAGGAGC 
TTGGTAGATC 
TTTATCTATC 
GTGCTAGCTA 
GCGATGTTCG 
TGATTACTCC 
CTTATGCCGA 
GACATCTCCT 
ATCCTAATAA 
GTACTGAGAC 
CAAGACATGG 
TATATGGCCA 
GAAGTAGAGT 
CTTTGAGATC 
CGAGCTCTCC 
ACTTGAATGG 
TCTCAGCTAA 
ATTCTCAAGG 
TGAGAGATTT 
TCTCCGGGAA 
TGGGGTATTC 
TTAGTGATGC 
TCAAAAAAGG 
AGAATAATAA 
TTTACAGTTA 
GGCGGAGTGG 
CGAGGGAACA 
ACGGAGACAG 
GGCAATAAAG 
CTAACGAAAG 



AGCTCTAAGT 
CTTTTCCAAT 
CCAGACTCCG 
TTTGTTACTC 
AGCTATAGAT 
AAATACTGCT 
TAACGAGGCT 
TGCTGTTCAG 
AAGTTTTTCC 
GATTTACTCC 
TGTTGCTTCT 
GAGTAATAAT 
CAATAACTCT 
AGCTGCTGGG 
TGTACAATTT 
AGAGCTCAGT 
AGCCAAAGAG 
GGGATCGGGA 
TGATCCCATC 
AATTAACGAT 
GTACCAAAAT 
AGTGAATTCT 
TTTTGTAACT 
CAATCTGCAT 
CAATCCTCCA 
TACAATTAGT 
TTGGCTAGGT 
AGCTAATGCC 
AAGCTGGAAG 
TACATGGACT 
TAGTTTATGG 
GGATGGGCGC 
TGACCGCGAT 
AAACTCCTAC 
TAAAGATTAT 
TACCCAACAA 
CGGGTTTGGG 
TTGGGAT AAT 
ATCTAAGCTC 
TCATGAATCT 
AAATCTATCA 
ATATAGCTTT 
AACGCTCCTA 
AGTTGTGGTT 
TGGAAGATAT 
CCGGTTCTAA 
TACATCATTT 
TCAAGTGTTA 
AGACTGCGCT 
TGGCGACAAT 
GCCAGTTCTT 
TTCGAGTCTG 
AACCGTGAGT 
TCCTTTATCT 
TCGGAAAGGG 
GGTCATGTTC 
TAATGCTGGT 
AAAACTGTAA 
TTTATAAAGG 
CAGCATACGA 
GAGGCGGTGG 
GTTCTATTGT 
AATTC 



720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4435 



(2) INFORMATION FOR SEQ ID NO: 2: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1012 amino acids 

(B) TYPE: £imino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Gin Thr Ser Phe His Lys Phe Phe Leu Ser Met lie Leu Ala Tyr 

1 5 10 15 

Ser Cys Cys Ser Leu Asn Gly Gly Gly Tyr Ala Ala Glu lie Met Val 

20 25 30 

Pro Gin Gly He Tyr Asp Gly Glu Thr Leu Thr Val Ser Phe Pro Tyr 
35 40 45 

10 Thr Val He Gly Asp Pro Ser Gly Thr Thr Val Phe Ser Ala Gly Glu 
50 55 60 

Leu Thr Leu Lys Asn Leu Asp Asn Ser He Ala Ala Leu Pro Leu Ser 
65 70 75 80 

Cys Phe Gly Asn Leu Leu Gly Ser Phe Thr Val. Leu Gly Arg Gly His 

^85 90 95 

Ser Leu Thr Phe Glu Asn He Arg Thr Ser Thr Asn Gly Ala Ala Leu 

100 105 110 

Ser Asn Ser Ala Ala Asp Gly Leu Phe Thr He Glu Gly Phe Lys Glu 

115 120 125 

Leu Ser Phe Ser Asn Cys Asn Ser Leu Leu Ala Val Leu Pro Ala Ala 

130 135 140 

Thr Thr Asn Lys Gly Ser Gin Thr Pro Thr Thr Thr Ser Thr Pro Ser 
145 150 155 160 

Asn Gly Thr He Tyr Ser Lys Thr Asp Leu Leu Leu Leu Asn Asn Glu 

165 170 175 

Lys Phe Ser Phe Tyr Ser Asn Leu Val Ser Gly Asp Gly Gly Ala He 
180 185 190 

2 0 Asp Ala Lys Ser Leu Thr Val Gin Gly He Ser Lys Leu Cys Val Phe 
195 200 205 

Gin Glu Asn Thr Ala Gin Ala Asp Gly Gly Ala Cys Gin Val Val Thr 

210 215 220 

Ser Phe Ser Ala Met Ala Asn Glu Ala Pro He Ala Phe Val Ala Asn 
225 .230 235 240 

Val Ala Gly Val Arg Gly Gly Gly He Ala Ala Val Gin Asp Gly Gin 

245 250 255 

Gin Gly val Ser Ser Ser Thr Ser Thr Glu Asp Pro Val Val Ser Phe 

260 265 270 

Ser Arg Asn Thr Ala Val Glu Phe Asp Gly Asn Val Ala Arg Val Gly 

275 280 285 

Gly Gly He Tyr Ser Tyr Gly Asn Val Ala Phe Leu Asn Asn Gly Lys 

290 295 300 

Thr Leu Phe Leu Asn Asn Val Ala Ser Pro Val Tyr He Ala Ala Lys 
305 310 315 320 

Gin Pro Thr Ser Gly Gin Ala Ser Asn Thr Ser Asn Asn Tyr Gly Asp 

325 330 335 

Gly Gly Ala He Phe Cys Lys Asn Gly Ala Gin Ala Gly Ser Asn Asn 

340 345 350 

Ser Gly Ser Val Ser Phe Asp Gly Glu Gly Val Val Phe Phe Ser Ser 

355 360 365 

Asn Val Ala Ala Gly Lys Gly Gly Ala He Tyr Ala Lys Lys Leu Ser 

370 375 380, 

Val Ala Asn Cys Gly Pro Val Gin Phe Leu Arg Asn He Ala Asn Asp 
385 390 395 400 

Gly Gly Ala He Tyr Leu Gly Glu Ser Gly Glu Leu Ser Leu Ser Ala 

405 410 415 

Asp Tyr Gly Asp He He Phe Asp Gly Asn Leu Lys Arg Thr Ala Lys 

420 425 430 

Glu Asn Ala Ala Asp Val Asn Gly Val Thr Val Ser Ser Gin Ala He 
435 .440 445 



25 



30 



35 
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Ser Met Gly Ser Oly Gly Lys lie Thr Thr Leu Arg Ala Lys Ala Gly 

His Gin II Leu Phe Asn Asp Pro He Glu Met Ala Asn Gly Asn Asn 

470 ff/o 
Gin pro Ala Gin Ser Ser Lys Leu Leu Lys He Asn Asp Gly Glu Gly 

485 490 
Tyr Thr Gly Asp He Val Phe Ala Asn Gly Ser Ser Thr Leu Tyr Gin 

500 505 
Asn val Thr He Glu Gin Gly Arg He Val Leu Arg Glu Lys Ala Lys 

C15 520 o^o 

Leu ser l\\ Asn Ser Leu Ser Gin Thr Gly Gly Ser Leu Tyr Met Glu 

con 535 
Ala Gly ser Thr Trp Asp Phe Val Thr Pro Gin Pro Pro Gin Gin Pro 

c>1C 550 

p" Ala Ala Asn Gin Leu He Thr Leu Ser Asn Leu His Leu Ser Leu 

565 5*^0 
ser ser Leu Leu Ala Asn Asn Ala Val Thr Asn Pro Pro Thr Asn Pro 

580 585 
pro Ala Gin Asp Ser His Pro Ala Val He Gly Ser Thr Thr Ala Gly 

595 600 605 

ser val Thr He Ser Gly Pro He Phe Phe Glu Asp Leu Asp Asp Thr 

filO 615 
Ala Tyr Asp Arg Tyr Asp Trp Leu Gly Ser Asn Gin Lys He Asn Val 

630 635 
Leu Lys Leu Gin Leu Gly Thr Lys Pro Pro Ala Asn Ala Pro Ser Asp 
645 

Leu Thr Leu Gly Asn Glu Met Pro Lys Tyr Gly Tyr Gin Gly Ser Trp 

Pro 

675 ' ' 680 685 



Lys Leu Ala ^rp Asp Pro Asn Thr Ala Asn Asn Gly Pro Tyr Thr Leu 

675 680 
Lys Ala Thr Trp Thr Lys Thr Gly Tyr Asn Pro Gly Pro Glu Arg Val 

Ser He Leu Asp He Arg 
715 720 



Ala le? Leu Val Pro Asn ler Leu Trp Gly Ser He Leu Asp He Arg 



ser Ala His Ser Ala He Gin Ala Ser Val A^p Gly Arg Ser Tyr Cys 

725 730 '■'^ 

Arg Gly Leu Trp Val Ser Gly Val Ser Asn Phe Phe Tyr His Asp Arg 

ASP Ala Leu gI? Gin Gly Tyr Arg J Jr He Ser Gly Gly Tyr Ser Leu 

755 760 '^^ , 

Gly Ala Asn Ser Tyr Phe Gly Ser Ser Met Phe Gly Leu Ala Phe Thr 

77n '775 
Glu Vai Phe Gly Arg Ser Lys Asp Tyr Val Val Cys Arg Ser Asn H.s 



His Ala cys He Gly Ser Val Tyr Leu Ser Thr Gin Gin Ala Leu Cys 

8 10 



785 790 795 

His 

Gly ser Tyr Leu Phe Gly Asp Ala Phe He Arg Ala Ser Tyr Gly Phe 

820 825 
Gly Asn Gin His Met Lys Thr Ser Tyr Thr Phe Ala Glu Glu Ser Asp 

Q-jc . 840 
val Arg Trp Asp Asn Asn Cys Leu Ala Gly Glu He Gly Ala Gly Leu 
850 855 860 



Phe val Gin Ala Glu IJe Ser Tyr Ala Asp Sis Glu Ser Phe Thr Glu 



Pro He val He Thr Pro Ser Lys Leu Tyr Leu Asn Glu Leu Arg Pro 

885 8*90 "~ 895 

Glu Gly Asp Gin Ala Arg Ala Phe Lys Ser Gly His Leu Leu Asn Leu 

900 905 
ser val Pro Val Gly Val Lys Phe Asp Arg Cys Ser Ser Thr His Pro 

Q15 920 
Asn Lys Tyr Ser Phe Met Ala Ala Tyr He Cys Asp Ala Tyr Arg Thr 
930 935 940 



He ser Gly Thr Glu Thr Thr Leu Leu Ser His Gin Glu Thr Trp Thr 
?Sr ASP Ala Phe His LeS Ala Arg His Gly Jal Val Val Arg Gly Ser 



Met Tyr Ala Ser Leu Thr Ser Asn lie Glu Val Tyr Gly His Gly Arg 

980 985 990 

Tyr Glu Tyr Arg Asp Ala Ser Arg Gly Tyr Gly Leu Ser Ala Gly Ser 

995 1000 1005 

Arg Val Arg Phe 
1010 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Glu He Met Val Pro Gin Gly He Tyr Asp Gly Glu Thr Leu Thr Val 

1 5 10 15 

Ser Phe Xaa Tyr 
20 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GAAATHATGG TNCCNCAA 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GAAATHATGG TNCCNCAG 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GAGATHATGG TNCCNCAA 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GAGATHATGG TNCCNCAG 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



NGTYTCNCCR TCATA 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
NGTYTCNCCR TCGTA 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1511 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: 

GAAATCATGG TTCCTCAAGG AATTTACGAT 
ACTGTTATAG GAGATCCGAG TGGGACTACT 
AATCTTGACA ATTCTATTGC AGCTTTGCCT 
TTTACTGTTT TAGGGAGAGG ACACTCGTTG 
GGGGCAGCTC TAAGTAATAG CGCTGCTGAT 
TTATCCTTTT CCTUVTTGCAA TTCATTACTT 
GGTAGCCAGA CTCCGACGAC AACATCTACA 
GATCTTTTGT TACTCAATAA TGAGAAGTTC 
GGGGGAGCTA TAGATGCTAA GAGCTTAACG 
CAAGAAAATA CTGCTCAAGC TGATGGGGGA 
ATGGCTAACG AGGCTCCTAT TGCCTTTGTA 
ATTGCTGCTG TTCAGGATGG GCAGCAGGGA 



SEQ ID NO: 10: 

GGGGAGACGT TAACTGTATC ATTTCCCTAT 
Q,j,rj,fj.rpr|,rj.cTG CAGGAGAGTT AACATTAAAA 
TTAAGTTGTT TTGGGAACTT ATTAGGGAGT 
ACTTTC6AGA ACATACGGAC TTCTACAAAT 
GCACTGTTTA CTATTGAGGG TTTTAAAGAA 
GCCGTACTGC CTGCTGCAAC GACTAATAAG 
CCGTCTAATG GTACTATTTA TTCTAAAACA 
TCATTCTATA GTAATTTAGT CTCTGGAGAT 
GTTCAAGGAA TTAGCAAGCT TTGTGTCTTC 
GCTTGTCAAG TAGTCACCAG TTTCTCTGCT 
GCGAATGTTG CAGGAGTAAG AGGGGGAGGG 
GTGTCATCAT CTACTTCAAC AGAAGATCCA 



GTAGTAAGTT 
GGAGGGATTT 
AACAATGTTG 
AATACGAGTA 
GGATCCAATA 
AATGTAGCTG 
GGCCCTGTAC 
5 TCTGGAGAGC 
AGAACAGCCA 
TCGATGGGAT 
TTTAATGATC 
CTAAAAATTA 
ACTTTGTACC 
TTATCAGTGA 



10 



15 



30 



35 



TTTCCAGAAA 
ACTCCTACGG 
CTTCTCCTGT 
ATAATTACGG 
ACTCTGGATC 
CTGGGAAAGG 
AATTTTTAAG 
TCAGTTTATC 
AAGAGAATGC 
CGGGAGGGAA 
CCATCGAGAT 
ACGATGGTGA 
AAAATGTTAC 
A 



TACTGCGGTA 
GAACGTTGCT 
TTACATTGCT 
AGATGGAGGA 
AGTTTCCTTT 
GGGAGCTATT 
GAATATCGCT 
TGCTGATTAT 
TGCCGATGTT 
AATAACGACA 
GGCAAACGGA 
AGGATACACA 
GATAGAGCAA 



GAGTTTGATG 
TTCCTGAATA 
GCTAAGCAAC 
GCTATCTTCT 
GATGGAGAGG 
TATGCCAAAA 
AATGATGGTG 
GGAGATATTA 
AATGGCGTAA 
TTAAGAGCTA 
AATAACCAGC 
GGGGATATTG 
GGAAGGATTG 



GGAACGTAGC 
ATGGAAAAAC 
CAACAAGTGG 
GTAAGAATGG 
GAGTAGTTTT 
AGCTCTCGGT 
GAGCGATTTA 
TTTTCGATGG 
CTGTGTCCTC 
AAGCAGGGCA 
CAGCGCAGTC 
TTTTTGCTAA 
TTCTTCGTGA 



CCGAGTAGGA 
CTTGTTTCTC 
ACAGGCTTCT 
TGCGCAAGCA 
CTTTAGTAGC 
TGCTAACTGT 
TTTAGGAGTVA 
GAATCTTAAA 
ACAAGCCATT 
TCAGATTCTC 
TTCCAAACTT 
TGGAAGCAGT 
AAAGGCAAAA 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1444 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



TTCTCTAAGT 
AACTCCACAA 
GCATTTGTCT 
TCCAGCGCAA 
TAGTGGGCCT 
AGGTTCTAAT 
TGCCCCATCA 

2 0 GAAGCTTGCG 
GACTAAAACT 
ATGGGGATCC 
GCGCTCTTAT 
CGATGCTTTA 
CTACTTTGGA 
TTATGTAGTG 
ACAAGCTTTA 
TGGGAATCAG 

2 5 TAATAACTGT 
GCTCTATTTG 
ATCTTTTACA 
ATCAGTTCCT 
CTTTATGGCG 
CCTATCCCAT 
GGTTAGAGGA 
ATATGAGTAT 
CTAA 



CAGACAGGTG 
CCACCACAAC 
CTTTCTTCTT 
GATTCTCATC 
ATCTTTTTTG 
CAAAAAATCA 
GATTTGACTC 
TGGGATCCTA 
GGGTATAATC 
ATTTTAGATA 
TGTCGAGGAT 
GGTCAGGGAT 
TCATCGATGT 
TGTCGTTCCA 
TGTGGATCCT 
CATATGAAAA 
CTGGCTGGAG 
AATGAGTTGC 
GAGGAAGGCG 
GTTGGAGTGA 
GCTTATATCT 
CAAGAGACAT 
TCTATGTATG 
CGAGATGCTT 



GGAGTCTGTA 
AGCCTCCTGC 
TGTTAGCAAA 
CTGCAGTCAT 
AGGATTTGGA 
ATGTCCTGAA 
TAGGGAATGA 
ATACAGCAAA 
CTGGGCCTGA 
TACGATCTGC 
TATGGGTTTC 
ATCGGTATAT 
TTGGTCTAGC 
ATCATCATGC 
ATTTGTTCGG 
CCTCATATAC 
AGATTGGAGC 
GTCCTTTCGT 
ATCAAGCTCG 
AGTTTGATCG 
GTGATGCTTA 
GGACAAGAGA 
CTTCTCTAAC 
CTCGAGGCTA 



TATGGAAGCT 
CGCTAATCAG 
CAATGCAGTT 
TGGTAGCACA 
TGATACAGCT 
ATTACAGTTA 
GATGCCTAAG 
TAATGGTCCT 
GCGAGTAGCT 
GCATTCAGCA 
TGGAGTTTCG 
TAGTGGGGGT 
ATTTACCGAA 
TTGCATAGGA 
AGATGCGTTT 
ATTTGCAGAG 
GGGATTACCG 
GCAAGCTGAG 
GGCATTCAAG 
ATGTTCTAGT 
TCGCACCATC 
TGCCTTTCAT 
AAGTAATATA 
TGGTTTGAGT 



GGGAGTACAT 
TTGATCACGC 
ACGAATCCTC 
ACTGCTGGTT 
TATGATAGGT 
GGGACTAAGC 
TATGGCTATC 
TATACTCTGA 
TCTTTGGTTC 
ATTCAAGCAA 
AATTTCTTCT 
TATTCCTTAG 
GTATTTGGTA 
TCCGTTTATC 
ATCCGTGCTA 
GAGAGCGATG 
ATTGTGATTA 
TTTTCTTATG 
AGCGGACATC 
ACACATCCTA 
TCTGGTACTG 
TTAGCAAGAC 
GAAGTATATG 
GCAGGAAGTA 



GGGATTTTGT 

TTTCCAATCT 

CTACCAATCC 

CTGTTACAAT 

ATGATTGGCT 

CCCCAGCTAA 

AAGGAAGCTG 

AAGCTACATG 

CAAATAGTTT 

GTGTGGATGG 

ATCATGACCG 

GAGCAAACTC 

GATCTAAAGA 

TATCTACCCA 

GCTACGGGTT 

TTCGTTGGGA 

CTCCATCTAA 

CCGATCATGA 

TCCTAAATCT 

ATAAATATAG 

AGACAACGCT 

ATGGAGTTGT. 

GCCATGGAAG 

GAGTCCGGTT 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AAGGGCCCAA TTACG.CAGAG GGTACCGAAA TTATGGTTCC TCAAGGAATT TACGAT 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1511 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1444 



56 
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r 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

^ (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
AAGGGCCCAA TTACGCAGAG GGTACCCTAA GAAGAAGGCA TGCCGTGCTA GCGGAG 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
AAGGGCCCAA TTACGCAGAG GGTACCGGAG AGCTCGCGAA TCCATACGAA TAGGAAC 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1013 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Gin Thr Ser Phe His Lys Phe Phe Leu Ser Met He Leu Ala Tyr 

Ser Cys Cys Ser Leu Asn Gly Gly Gly Tyr Ala Ala Glu lie Met Val 

" pro Gin Gly Tyr Asp Gly Glu Thr Leu Thr Val Ser Phe Pro Tyr 

— J- 40 
Thr val lie Gly Asp Pro Ser Gly Thr Thr Val Phe Ser Ala Gly Glu 

Lys 
Asn 
Phe 

ser ASP ser a!" Asn Ser Gly Leu Phe Thr He Glu Gly Phe Lys Glu 

lit; 120 xz D 

Leu ser Ihl Ser Asn Cys Asn Pro Leu Leu Ala Val Leu Pro Ala Ala 

- _ 135 14U 

Thr ihr Asn Asn Gly Ser Gin Thr Pro Ser Thr Thr Ser Thr Pro Ser 

III Gly Thr lie Tyr Ser Lys Thr Asp Leu Leu Leu Leu Asn Asn Glu 

" Lys Phe ser Phe iyr Ser Asn Ser Val Ser Gly Asp Gly Gly Ala He 

ASP Ala Lys Ser Leu Thr Val Gin G^y He Ser Lys Leu Cys Val Phe 
195 200 

- 15 - 



Leu tL Leu Lys Asn Leu Asp Asn Ser He Ala Aia Leu Pro Leu Ser 
- 7Q / o . 

cys Phe Gly Asn Leu Leu Gly Ser Phe Thr Val Leu Gly Arg Gly Hxs 

30 ser Leu Thr Phe Glu Asn He Arg Thr Lr Thr Asn Gly Ala Ala Leu 

inn 105 ^-^^ 



56 



57 



PENY4-531696.1 



Gin Glu Asn Thr Ala Gin Ala Asp Gly Gly Ala Cye Gin Val Val Thr 

210 215 220 

ser Phe Ser Ala Met Ala Asn Glu Ala Pro He Ala Phe Val Ala Asn 
225 230 235 240 

Val Ala Gly Val Arg Gly Gly Gly He Ala Ala Val Gin Asp Gly Gin 

245 250 255 

Gin Gly Val Ser Ser Ser Thr Ser Thr Glu Asp Pro Val Val Ser Phe 
^ 260 265 270 

Ser Arg Asn Thr Ala Val Glu Phe Asp Gly Asn Val Ala Arg Val Gly 

275 280 285 

Glv Gly He Tyr Ser Tyr Gly Asn Val Ala Phe Leu Asn Asn Gly Lys 

290 295 300 

Thr Leu Phe Leu Asn Asn Val Ala Ser Pro Val Tyr He Ala Ala Glu 
305 310 315 320 

Gin Pro Thr Asn Gly Gin Ala Ser Asn Thr Ser Asp Asn Tyr Gly Asp 
325 330 335 

1 n Gly Gly Ala He Phe Cys Lys Asn Gly Ala Gin Ala Ala Gly Ser Asn 
340 345 350 

Asn Ser Gly Ser Val Ser Phe Asp Gly Glu Gly Val Val Phe Phe Ser 

355 360 365 

Ser Asn Val Ala Ala Gly Lys Gly Gly Ala He Tyr Ala Lys Lys Leu 

370 375 380 

Ser Val Ala Asn Cys Gly Pro Val Gin Leu Leu Gly Asn He Ala Asn 
385 390 395 400 

Asp Gly Gly Ala He Tyr Leu Gly Glu Ser Gly Glu Leu Ser Leu Ser 
- f J J 415 

Ala Asp Tyr Gly Asp Met He Phe Asp Gly Asn Leu Lys Arg Thr Ala 

420 425 430 

Lys Glu Asn Ala Ala Asp Val Asn Gly Val Thr Val Ser Ser Gin Ala 

435 440 445 

He Ser Met Gly Ser Gly Gly Lys He Thr Thr Leu Arg Ala Lys Ala 

450 455 460 

Gly His Gin He Leu Phe Asn Asp Pro He Glu Met Ala Asn Gly Asn 
465 470 475 480 

on Asn Gin Pro Ala Gin Ser Ser Glu Pro Leu Lys He Asn Asp Gly Glu 

485 490 495 

Glv Tyr Thr Gly Asp He Val Phe Ala Asn Gly Asn Ser Thr Leu Tyr 

500 505 510 

Gin Asn Val Thr He Glu Gin Gly Arg He Val Leu Arg Glu Lys Ala 

515 520 525 

Lys Leu Ser Val Asn Ser Leu Ser Gin Thr Gly Gly Ser Leu Tyr Met 

530 535 540 , 

Glu Ala Gly ser Thr Leu Asp Phe Val Thr Pro Gin Pro Pro Gin Gin 
05 545 550 555 560 

Pro Pro Ala Ala Asn Gin Ser He Thr Leu Ser Asn Leu His Leu Ser 

565 570 575 

Leu Ser Ser Leu Leu Ala Asn Asn Ala Val Thr Asn Pro Pro Thr Asn 

580 585 590 

Pro Pro Ala Gin Asp Ser His Pro Ala Val He Gly Ser Thr Thr Ala 

595 600 605 

Glv ser Val Thr He Ser Gly Pro He Phe Phe Glu Asp Leu Asp Asp 
610 615 620 

OA Thr Ala Tyr Asp Arg Tyr Asp Trp Leu Gly Ser Asn Gin Lys He Asp 
625 630 635 o4U 

Val Leu Lys Leu Gin Leu Gly Thr Gin Pro Pro Ala Asn Ala Pro Ser 

645 650 655 

ASP Leu Thr Leu Gly Asn Glu Met Pro Lys Tyr Gly Tyr Gin Gly Ser 

660 665 670 

Trp Lys Leu Ala Trp Asp Pro Asn Thr Ala Asri Asn Gly Pro Tyr Thr 

675 680 685 

Leu Lys Ala Thr Trp Thr Lys Thr Gly Tyr Asn Pro Gly Pro Glu Arg 
,5 690 695 700 

Val Ala Ser Leu Val Pro Asn Ser Leu Trp Gly Ser He Leu Asp He 
705 710 715 720 

Arg ser Ala His Ser Ala He Gin Ala Ser Val Asp Gly Arg Ser Tyr 
• 725 730 735 
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Cys Arg Gly Leu Trp Val Ser Gly Val Ser Asn Phe Phe Tyr His Aap 
740 745 



Leu Gly Ala Asn Ser Tyr Phe Gly Ser Ser Met Phe Gly Leu Ala Phe 

780 



740 — « 

Arg ABp Ala Leu Gly Gin Gly Tyr Arg Tyr He Ser Gly Gly Tyr Ser 
755 760 
Ala Asn Ser Tyr Phe 

Thr gIS val Phe Gly Arg sir Lys Asp Tyr Val vll Cys Arg Ser Asn 
70C 790 795 

5 His His Ala cys He Gly Ser Val Tyr Leu Ser Thr Lys Gin Ala Leu 

805 810 '^^^ 

cys Gly Ser Tyr Val Phe Gly Asp Ala Phe He Arg Ala Ser Tyr Gly 

820 825 830 

Phe Gly Asn Gin His Met Lys Thr Ser Tyr Thr Phe Ala Glu Glu Ser 

835 840 
ASP val cys Trp Asp Asn Asn Cys Leu Val Gly Glu He Gly Val Gly 
ftcn 855 860 

10 Leu Pro He Val He Thr Pro Ser Lys Leu Tyr Leu Asn Glu Leu Arg 

III Phe val Gin Ala gIS Phe Ser Tyr Ala Asp His Glu Ser Phe Thr 

885 890 
Glu Glu Gly Asp Gin Ala Arg Ala Phe Arg Ser Gly His Leu Met Asn 

900 ^-^^ 
Leu ser Val Pro Val Gly Val Lys Phe Asp Arg Cys Ser Ser Thr Hxs 
915 920 
Asn Lys Tyr Ser Phe Met Gly Ala Tyr He 
930 935 

He Ser Gly Thr Gin Thr Thr Leu Leu Ser 
QAc 950 555 

Thr Thr Asp Ala Phe His Leu Ala Arg His Gly Val He Val Arg Gly 



915 920 525 

Lys 

935 

Thr He Ser Gly Thr Gin Thr Thr Leu Leu Ser His Gin Glu Thr Trp 



915 920 
Pro Asn Lys Tyr Ser Phe Met Gly Ala Tyr He Cys Asp Ala Tyr Arg 



965 570 

ser Met Tyr Ala Ser Leu Thr Ser Asn He Glu Val Tyr Gly His Gly 

980 585 
Arg Tyr Glu Tyr Arg Asp Thr Ser Arg Gly Tyr Gly Leu Ser Ala Gly 
995 1000 1005 

2 0 Ser Lys Val Arg Phe 
1010 

(2) INFORMATION FOR SEQ ID N0:16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1013 amino acids 

(B) TYPE': amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : unknown 



25 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



30 



Met 


Gin 


Thr 


Ser 


Phe 
5 


His Lys 


Phe 


Phe 


1 
Ser 


Cys 


Cys 


Ser 


Leu 


Thr Gly 


Gly Gly 






20 








25 


Pro 


Gin Gly 


He 


Tyr Asp Gly 


Glu 


Thr 






35 








40 




Thr 


Val 


He 


Gly 


Asp 


Pro Ser 


Gly 


Thr 




50 








55 




Ser 


Leu 


Thr 


Leu 


Lys 


Asn 


Leu Asp 


Asn 


65 








70 




Phe 


Cys 


Phe Gly 


Asn 


Leu 


Leu Gly 


Ser 








85 








Ser 


Leu 


Thr 


Phe 


Glu 


Asn He 


Arg 


Thr 






100 








105 


Ser 


Asp 


Ser 


Ala 


Asn 


Ser Gly 


Leu 


Phe 




115 








120 





10 15 
Tvr Ala Ala Glu He Met Val 
30 

Leu Thr Val Ser Phe Pro Tyr 
45 

Thr Val Phe Ser Ala Gly Glv 
60 

He Ala Ala Leu Pro Leu Sei 
75 80 
Thr Val Leu Gly Arg Gly Hit 
90 55 
Ser Thr Asn Gly Ala Ala Lei 
110 

Thr He Glu Gly Phe Lys Glv 
125 
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L u Ser Phe Ser Asn Cys Asn Ser Leu Leu Ala Val Leu Pro Ala Ala 

130 135 140 

Thr Thr Asn Asn Gly Ser Gin Thr Pro Thr Thr Thr Ser Thr Pro Ser 
145 150 155 160 

Asn Gly Thr He Tyr Ser Lys Thr Asp Leu Leu Leu Leu Asn Asn Glu 

165 170 1'5 

Lys Phe Ser Phe Tyr Ser Asn Leu Val Ser Gly Asp Gly Gly Thr He 
180 185 ISO , . 

* Aso Ala Lys Ser Leu Thr Val Gin Gly He Ser Lys Leu Cys Val Phe 
195 200 205 

Gin Glu Asn Thr Ala Gin Ala Asp Gly Gly Ala Cys Gin Val Val Thr 

210 215 220 

Ser Phe Ser Ala Met Ala Asn Glu Ala Pro He Ala Phe He Ala Asn 
ooc 230 235 ^7" 

val Ala Gly Val Arg Gly Gly Gly He Ala Ala Val Gin Asp Gly Gin 
245 250 , 255 

1 n Gin Gly Val Ser Ser Ser Thr Ser Thr Glu Asp Pro Val Val Ser Phe 
260 265 ^ /u 

ser Arg Asn Thr Ala Val Glu Phe Asp Gly Asn Val Ala Arg Val Gly 

275 280 285 

Glv Gly He Tyr Ser Tyr Gly Asn Val Ala Phe Leu Asn Asn Gly Lys 

290 295 300 

Thr Leu Phe Leu Asn Asn Val Ala Ser Pro Val Tyr He Ala Ala Glu 
305 310 315 -^^^ 

Gin pro Thr Asn Gly Gin Ala Ser Asn Thr Ser Asp Asn Tyr Gly Asp 

325 330 -^-^^ 

Gly Gly Ala He Phe Cys Lys Asn Gly Ala Gin Ala Ala Gly Ser Asn 

340 345 350 

Asn ser Gly Ser Val Ser Phe Asp Gly Glu Gly Val Val Phe Phe Ser 

355 360 365 

ser Asn Val Ala Ala Gly Lys Gly Gly Ala He Tyr Ala Lys Lys Leu 

370 375 
ser val Ala Asn Cys Gly Pro Val Gin Phe Leu Gly Asn He Ala Asn 

385 390 
oo Asp Gly Gly Ala He Tyr Leu Gly Glu Ser Gly Glu Leu Ser Leu Ser 

405 410 ^-^^ 

Ala Asp Tyr Gly Asp He He Phe Asp Gly Asn Leu Lys Arg Thr Ala 

420 425 
Lys Glu Asn Ala Ala Asp Val Asn Gly Val Thr Val Ser Ser Gin Ala 

435 440 445 

lie ser Met Gly Ser Gly Gly Lys He Thr Thr Leu Arg Ala Lys Ala 

450 455 460 

Gly His Gin He Leu Phe Asn Asp Pro He Glu Met Ala Asn Gly Asn 
AC.C 470 47 5 

Asn Gin Pro Ala Gin Ser Ser Glu Pro Leu Lys He Asn Asp Gly Glu 

490 



485 

Gly Tyr Thr Gly Asp He Val Phe Ala Asn Gly Asn Ser Thr Leu Tyr 

500 505 510 

Gin Asn Val Thr He Glu Gin Gly Arg He Val Leu Arg Glu Lys Ala 

515 520 
Lys Leu Ser Val Asn Ser Leu Ser Gin Thr Gly Gly Ser Leu Tyr Met 
530 535 
30 Glu Ala Gly Ser Thr Leu Asp Phe Val Thr Pro Gin Pro Pro Gin Gin 
cAc 550 555 

pro Pro Ala Ala Asn Gin Leu He Thr Leu Ser Asn Leu His Leu Ser 

565 570 
Leu ser Ser Leu Leu Ala Asn Asn Ala Val Thr Asn Pro Pro Thr Asn 

580 585 , 

pro pro Ala Gin Asp Ser His Pro Ala Val He Gly Ser Thr Thr Ala 

595 600 
Gly Pro val Thr He Ser Gly Pro Phe Phe Phe Glu Asp Leu Asp Asp 

filO 615 
Thr Ala Tyr Asp Arg Tyr Asp Trp Leu Gly Ser Asn Gin Lys He Asp 

Sal Leu Lys Leu Gin LeS Gly Thr Gin Pro Ser Ala Asn Ala Pro Ser 
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f 



r 



«p ^ Thr Le„ Gly Aen olu Met Pro :.yB Tyr Oly Tyr Gin Gly Ser 
.ys «u M". «p «P pro «. »n «" -y Pro Tyr T.r 

.y. T>,r Trp »r .y, 5^ Gly Tyr »n Pro Gly Pro Glu 
val M. s.r Leu v.l Pro III S.r Leu Trp Gly S.r He Leu «P II. 
» ser .1. HU ser il" He ol„ .1. Ser V.1 «p Gly Ser Tyr 

cy= Gly V.1 ser Gly v.l Ur «n PHe ser Tyr „U «P 

«p .1. TeS Gly Gl„ Gly Tyr Tyr Xle Ser 01, Oly Tyr Ser 
oly Til ser Tyr Ph. ser ser Met P.e Gly Leu «. PHe 
Thr Vll V.1 Phe Gl, HI ^-V »P ^' ^.1 V.l Cy= «, ser «J 
" III Hie .1. eye lie 11^ ser V.1 Tyr Leu Zl TKr Ly. Gl„ M. Leu 
cy. Gly ser Tyr Pne Oly «P .1| P- Ifo 
.He Gl, «n Hi. Met Lye THr III Tyr THr PHe .1. Glu Glu Ser 
«p V.1 ill Trp »P «n «n If. Leu V.l Gly Glu XI. Oly v.l Gly 
" II. V.1 Thr Thr 1^^ Ser Lye Leu Tyr Leu «n Glu Leu Ar, 

III Ph. V.1 Gl„ Al. Ill Ph. ser Tyr Al. Aep HI. Glu s.r Ph. Thr 
885 ^ °^ Piv His Leu Met Asn 

Glu Glu Gly ASP Gin Ala Arg Ala Phe Arg Ser Gly His L^^ 

900 Z!i Arrt CvB Ser Ser Thr His 

Leu ser Val Pro Val Gly Val Lys Phe Asp Arg Cys Se^ 

,0 pro Asn 'ril Tyr Ser Phe Me. 11^ Ala Tyr Xle Cys Asp Ala Tyr Arg 

..r III ser Gly Thr Gin III Thr Leu Leu Ser His Gin Glu Thr Trp 

545 , ^ u<a riv Val He Val Arg Gly 

?ir Thr ASP Ala Phe His Leu Ala Arg His Gly 

* rin Val Tvr Glv His Gly 

ser Met Tyr Ala Ser Leu Thr Ser Asn He Glu Val Ty ^^y 

980 Glv Leu Ser Ala Gly 

Arg Tyr Glu Tyr Arg Asp Thr^Ser Arg Gly Tyr Gly^L^^ 



25 



Ser Lys Val Arg Phe 
1010 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 505 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

30 ^jjj TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

rin Thr Leu Thr Val 
Glu He Met V.l Pro Gin Gly He Tyr A.p oly Glu 

35 Ph. pro Tyr tL val He Gly «P Pro ser Gly Thr Thr v.l Ph. 

S.r .1. Gly ^"lu Leu Thr Leu Ly. L„ L.u »ep ..n s.r He .1. M. 

35 
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Leu Pro Leu Ser Cys Phe Gly Asn Leu Leu Gly Ser Phe Thr Val Leu 

50 55 60 

Gly Arg Gly His Ser Leu Thr Phe Glu Asn lie Arg Thr Ser Thr Aan 
65 70 75 80 

Gly Ala Ala Leu Ser Asn Ser Ala Ala Asp Gly Leu Phe Thr lie Glu 
85 90 , 

Glv Phe Lys Glu Leu Ser Phe Ser Asn Cys Asn Ser Leu Leu Ala Val 
. 100 105 110 

Leu Pro Ala Ala Thr Thr Asn Lys Gly Ser Gin Thr Pro Thr Thr Thr 

115 120 125 

Ser Thr Pro Ser Asn Gly Thr He Tyr Ser Lys Thr Asp Leu Leu Leu 

130 135 140 

Leu Asn Asn Glu Lys Phe Ser Phe Tyr Ser Asn Leu Val Ser Gly Asp 
145 150 155 160 

Gly Gly Ala He Asp Ala Lys Ser Leu Thr Val Gin Gly He Ser Lys 
' 165 170 175 

1 n Leu Cys Val Phe Gin Glu Asn Thr Ala Gin Ala Asp Gly Gly Ala Cys 
180 185 190 

Gin Val Val Thr Ser Phe Ser Ala Met Ala Asn Glu Ala Pro He Ala 

195 200 205 

Phe Val Ala Asn Val Ala Gly Val Arg Gly Gly Gly He Ala Ala Val 

210 215 220 

Gin Asp Gly Gin Gin Gly Val Ser Ser Ser Thr Ser Thr Glu Asp Pro 
225 230 235 240 

Val Val Ser Phe' Ser Arg Asn Thr Ala Val Glu Phe Asp Gly Asn Val 

245 250 255 

Ala Arg Val Gly Gly Gly He Tyr Ser Tyr Gly Asn Val Ala Phe Leu 

260 265 270 

Asn Asn Gly Lys Thr Leu Phe Leu Asn Asn Val Ala Ser Pro Val Tyr 

275 280 285 

He Ala Ala Lys Gin Pro Thr Ser Gly Gin Ala Ser Asn Thr Ser Asn 

290 295 300 

Asn Tyr Gly Asp Gly Gly Ala He Phe Cys Lys Asn Gly Ala Gin Ala 
305 310 315 320 

,o Gly Ser Asn Asn Ser Gly Ser Val Ser Phe Asp Gly Glu Gly Val Val 
' 325 330 335 

Phe Phe Ser Ser Asn Val Ala Ala Gly Lys Gly Gly Ala He Tyr Ala. 

340 345 350 

Lys Lys Leu Ser Val Ala Asn Cys Gly Pro Val Gin Phe Leu Arg Asn 

355 360 365 

He Ala Asn Asp Gly Gly Ala He Tyr Leu Gly Glu Ser Gly Glu Leu 

370 375 380 

ser Leu Ser Ala Asp Tyr Gly Asp He He Phe Asp Gly Asn Leu Lys 
,c 385 390 395 400 

Arg Thr Ala Lys Glu Asn Ala Ala Asp Val Asn Gly Val Thr Val Ser 

^ 405 410 415 

ser Gin Ala He Ser Met Gly Ser Gly Gly Lys He Thr Thr Leu Arg 

420 425 430 

Ala Lys Ala Gly His Gin He Leu Phe Asn Asp Pro He Glu Met Ala 

435 440 '^^^ , ^ 

Asn Gly Aan Asn Gin Pro Ala Gin Ser Ser Lys Leu Leu Lys He Asn 
450 455 460 

■JO Asp Gly Glu Gly Tyr Thr Gly Asp He Val Phe Ala Asn Gly Ser Ser 
465 470 475 480 

Thr Leu Tyr Gin Asn Val Thr He Glu Gin Gly Arg He Val Leu Arg 
485 490 495 



Glu Lys Ala Lys Leu Ser Val Asp Ser 
500 505 

(2) INFORMATION FOR SEQ ID NO: 18; 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



- 80 - 



PENY3-594263.1 



10 



20 



22 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
AAGGGCCCAA TTACGCAGAG CTCGAGAGAA ATTATGGTTC CTCAAGGAAT TTACGAT 57 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CGCTCTAGAA CTAGTGGATC 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
ATGGTTCCTC AAGGAATTTA CG 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GGTCCCCCAT CAGCGG.GAG 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1515 base pairs 

*Q (B) TYPE: nucleic acid 

^ (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

_ _ PENY3-594263.1 



19 



60 

120 
180 
240 
300 
360 



GGTAGCCAGA 
GATCTTTTGT 
GGGGGAGCTA 
CAAGAAAATA 
ATGGCTAACG 
ATTGCTGCTG 
GTAGTAAGTT 
5 GGAGGGATTT 
AACAATGTTG 
AATACGAGTA 
GGATCCAATA 
AATGTAGCTG 
GGCCCTGTAC 
TCTGGAGAGC 
AGAACAGCCA 
TCGATGGGAT 



10 



15 



TTTAATGATC 
CTAAAAATTA 
ACTTTGTACC 
TTATCAGTGA 



CTCCGACGAC 

TACTCAATAA 

TAGATGCTAA 

CTGCTCAAGC 

AGGCTCCTAT 

TTCAGGATGG 

TTTCCAGAAA 

ACTCCTACGG 

CTTCTCCTGT 

ATAATTACGG 

ACTCTGGATC 

CTGGGAAAGG 

AATTTTTAAG 

TCAGTTTATC 

AAGAGAATGC 

CGGGAGGGAA 

CCATCGAGAT 

ACGATGGTGA 

AAAATGTTAC 

ATTCT 



AACATCTACA 

TGAGAAGTTC 

GAGCTTAACG 

TGATGGGGGA 

TGCCTTTGTA 

GCAGCAGGGA 

TACTGCGGTA 

GAACGTTGCT 

TTACATTGCT 

AGATGGAGGA 

AGTTTCCTTT 

GGGAGCTATT 

GAATATCGCT 

TGCTGATTAT 

TGCCGATGTT 

AATAACGACA 

GGCAAACGGA 

AGGATACACA 

GATAGAGCAA 



CCGTCTAATG 

TCATTCTATA 

GTTCAAGGAA 

GCTTGTCAAG 

GCGAATGTTG 

GTGTCATCAT 

GAGTTTGATG 

TTCCTGAATA 

GCTAAGCAAC 

GCTATCTTCT 

GATGGAGAGG 

TATGCCAAAA 

AATGATGGTG 

GGAGATATTA 

AATGGCGTAA 

TTAAGAGCTA 

AATAACCAGC 

GGGGATATTG 

GGAAGGATTG 



GTACTATTTA 

GTAATTTAGT 

TTAGCAAGCT 

TAGTCACCAG 

CAGGAGTAAG 

CTACTTCAAC 

GGAACGTAGC 

ATGGAAAAAC 

CAACAAGTGG 

GTAAGAATGG 

GAGTAGTTTT 

AGCTCTCGGT 

GAGCGATTTA 

XTTTCGATGG 

CTGTGTCCTC 

AAGCAGGGCA 

CAGCGCAGTC 

TTTTTGCTAA 

TTCTTCGTGA 



TTCTAAAACA 
CTCTGGAGAT 
TTGTGTCTTC 
TTTCTCTGCT 
AGGGGGAGGG 
AGAAGATCCA 
CCGAGTAGGA 
CTTGTTTCTC 
ACAGGCTTCT 
TGCGCAAGCA 
CTTTAGTAGC 
TGCTAACTGT 
TTTAGGAGAA 
GAATCTTAAA 
ACAAGCCATT 
TCAGATTCTC 
TTCCAAACTT 
TGGAAGCAGT 
AAAGGCAAAA 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3354 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



ATGCAAACGT 
2 0 TTAAATGGGG 
TTAACTGTAT 
GCAGGAGAGT 
TTTGGGAACT 
AACATACGGA 
ACTATTGAGG 
CCTGCTGCAA 
GGTAC^TATTT 
AGTAATTCAG 
2 5 ATTAGCAAGC 
GTAGTCACCA 
GCAGGAGTAA 
TCTACTTCAA 
GGGAACGTAG 
AATGGAAAAA 
CCAACAAATG 
TGTAAGAATG 
GAGGGAGTAG 
30 AAAAAGCTCT 
GGTGGAGCGA 
ATGATTTTCG 
GTAACTGTGT 
GCTAAAGCAG 
CAGCCAGCGC 
ATTGTTTTTG 
ATTGTTCTTC 
CTGTATATGG 
35 CCTGCCGCTA 
GCAAACAATG 
GTCATTGGTA 
TTGGATGATA 
CTGAAATTAC 



CTTTCCATAA 

GGGGGTATGC 

CATTTCCCTA 

TAACGTTAAA 

TATTAGGGAG 

CTTCTACAAA 

GTTTTAAAGA 

CGACTAATAA 

ATTCTAAAAC 

TCTCTGGAGA 

TTTGTGTCTT 

GTTTCTCTGC 

GAGGGGGAGG 

CAGAAGATCC 

CCCGAGTAGG 

CCTTGTTTCr 

GACAGGCTTC 

GTGCGCAAGC 

TTTTCTTTAG 

CGGTTGCTAA 

TTTATTTAGG 

ATGGGAATCT 

CCTCACAAGC 

GGCATCAGAT 

AGTCTTCCGA 

CTAATGGAAA 

GTGAAAAGGC 

AAGCTGGGAG 

ATCAGTCGAT 

CAGTTACGAA 

GCACAACTGC 

CAGCTTATGA 

AGTTAGGGAC 



GTTCTTTCTT 
AGAAATCATG 
TACTGTTATA 
AAATCTTGAC 
TTTTACTGTT 
TGGAGCTGCA 
ATTATCTTTT 
TGGTAGCCAG 
AGATCTTTTG 
TGGGGGAGCT 
CCAAGAAAAT 
TATGGCTAAC 
GATTGCTGCT 
AGTAGTAAGT 
AGGAGGGATT 
CAACAATGTT 
TAATACGAGT 
AGCAGGATCC 
TAGCAATGTA 
CTGTGGCCCT 
AGAATCTGGA 
TAAAAGAACA 
CATTTCGATG 
TCTCTTTAAT 
ACCTCTAAAA 
CAGTACTTTG 
AAAATTATCA 
TACATTGGAT 
CACGCTTTCC 
TCCTCCTACC 
TGGTTCTGTT 
TAGGTATGAT 
TCAGCCCCCA 



TCAATGATTC 
GTTCCTCAAG 
GGAGATCCGA 
AATTCTATTG 
TTAGGGAGAG 
CTAAGTGACA 
TCCAATTGCA 
ACTCCGTCGA 
TTACTCAATA 
ATAGATGCTA 
ACTGCTCAAG 
GAGGCTCCTA 
GTTCAGGATG 
TTTTCCAGAA 
TACTCCTACG 
GCTTCTCCTG 
GATAATTACG 
AATAACTCTG 
GCTGCTGGGA 
GTACAACTCT 
GAGCTCAGTT 
GCCAAAGAGA 
GGATCGGGAG 
GATCCCATCG 
ATTAACGATG 
TACCAAAATG 
GTGAATTCTC 
TTTGTAACTC 
AATCTGCATT 
AATCCTCCAG 
ACAATTAGTG 
TGGCTAGGTT 
GCTAATGCCC 



TAGCTTATTC 

GAATTTACGA 

GTGGGACTAC 

CAGCTTTGCC 

GACACTCGTT 

GCGCTAATAG 

ACCCATTACT 

CAACATCTAC 

ATGAGAAGTT 

AGAGCTTAAC 

CTGATGGGGG 

TTGCCTTTGT 

GGCAGCAGGG 

ATACTGCGGT 

GGAACGTTGC 

TTTACATTGC 

GAGATGGAGG 

GATCAGTTTC 

AAGGGGGAGC 

TAGGGAATAT 

TATCTGCTGA 

ATGCTGCCGA 

GGAAAATAAC 

AGATGGCAAA 

GTGAAGGATA 

TTACGATAGA 

TAAGTCAGAC 

CACAACCACC 

TGTCTCTTTC 

CGCAAGATTC 

GGCCTATCTT 

CTAATCAAAA 

CATCAGATTT 



TTGCTGCTCT 
TGGGGAGACG 
TGTTTTTTCT 
TTTAAGTTGT 
GACTTTCGAG 
CGGGTTATTT 
TGCCGTACTG 
ACCGTCTAAT 
CTCATTCTAT 
GGTTCAAGGA 
AGCTTGTCAA 
AGCGAATGTT 
AGTGTCATCA 
AGAGTTTGAT 
TTTCCTGAAT 
TGCTGAGCAA 
AGCTATCTTC 
CTTTGATGGA 
TATTTATGCC 
CGCTAATGAT 
TTATGGAGAT 
TGTTAATGGC 
GACATTAAGA 
CGGAAATAAC 
CACAGGGGAT 
GCAAGGAAGG 
AGGTGGGAGT 
ACAACAGCCT 
TTCTTTGTTA 
TCATCCTGCA 
TTTTGAGGAT 
AATCGATGTC 
GACTCTAGGG 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320. 
1380 
1440 
1500 
1515 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
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CCTGAGCGAG TAJCTTCT ^qcAAGTOTG JJJ^cGATG CTTTAGGTCA ^.^TGOT 234U 

SmSwS CT0»^? S^ScT iCC^»« SSoC^JT 

mmmmwm i 

CATACATTAT CATTT ^^^^^.^ 
t2) INFOBMATION FOR SEQ 



20 



25 



acgttaactg raaaaatctt. J ^j^ggga g^°°^c5taa tagcgggtta 

?CTGCAGGAG AGTT^^^ GAGTTTTACT |ItcTAAGTG ACAGCGCTAft ^^gCCGTA ^qQ 
^StTTTGGGA ^^TTATTA^ AAATGGAGCT GCACT GCAACTCATT ^^cCGTCT 

GGCGTAACTG ^^^^ G^IISSa AAAATTAACG ^JGGTG^ agagcaagga 

AGAGCTAAAG GA^r^ cGAACCTCTA "^^cCAAA P^TGTaft^ gacaggtggg ^ 

ScTGCCG CTJ^T^Jf^ g^^ATCCTCCT ACCAATCCT 
TTAGCAAACA ATGCAV, _ 83 ' 



10 



15 



20 



25 



30 



GCAGTCATTG 
GATTTGGATG 
GTCCTGAAAT 
GGGAATGAGA 
ACAGCAAATA 
GGGCCTGAGC 
CGATCTGCGC 
TGGGTTTCTG 
CGGTATATTA 
GGTCTAGCAT 
CATCATGCTT 
TTGTTCGGAG 
TCATACACAT 
ATTGGAGTGG 
CCTTTCGTGC 
CAAGCTCGGG 
TTTGATCGAT 
GATGCTTATC 
ACAACAGATG 
TCTCTAACAA 
CGAGGTTATG 
TAAGTGTTAG 
TGTTCCTATT 
CCTTTTAAGG 
GGGGCAGAGA 
AACCATGCAT 



GTAGCACAAC 
ATACAGCTTA 
TACAGTTAGG 
TGCCTAAGTA 
ATGGTCCTTA 
GAGTAGCTTC 
ATTCAGCAAT 
GAGTTTCGAA 
GTGGGGGTTA 
TTACCGAAGT 
GCATAGGATC 
ATGCGTTTAT 
TTGCAGAGGA 
GATTACCGAT 
AAGCTGAGTT 
CATTCAGGAG 
GTTCTAGTAC 
GCACCATCTC 
CCTTTCATTT 
GCAATATAGA 
GTTTGAGTGC 
CGATGCCTTT 
CGTATGGATT 
GGGACGATGT 
ACGGCTCAAT 
TATCATTTAC 



TGCTGGTCCT 
TGATAGGTAT 
GACTCAGCCC 
TGGCTATCAA 
TACTCTGAAA 
TTTGGTTCCA 
TCAAGCAAGT 
TTTCTCCTAT 
TTCCTTAGGA 
ATTTGGTAGA 
CGTTTATCTA 
CCGTGCTAGC 
GAGCGATGTT 
TGTGACTACT 
TTCTTATGCC 
TGGTCATCTC 
ACACCCTAAT 
TGGGACTCAG 
GGCAAGACAT 
AGTATATGGC 
AGGAAGTAAA 
TTCTTTGAGA 
CGCGAGCTCT 
TTACTTGAAT 
TATCTCAGCT 
AGAT 



GTCACAATTA 
GATTGGCTAG 
TCAGCTAATG 
GGAAGCTGGA 
GCTACATGGA 
AATAGTTTAT 
GTGGATGGGC 
CATGACCGCG 
GCAAACTCCT 
TCTAAAGATT 
TCTACCAAAC 
TACGGGTTTG 
CGTTGGGATA 
CCATCTAAGC 
GATCATGAAT 
ATGAATCTAT 
AAATATAGCT 
ACAACACTCC 
GGAGTCATAG 
CATGGAAGAT 
GTCCGGTTCT 
TCTACATCAT 
CCTCAAGTGT 
GGAGACTGCG 
AATGGCGACA 



GTGGGCCTTT 
GTTCTAATCA 
CCCCATCAGA 
AGCTTGCGTG 
CTAAAACTGG 
GGGGATCCAT 
GCTCTTATTG 
ATGCTTTAGG 
ACTTTGGATC 
ATGTAGTGTG 
AAGCTTTATG 
GGAACCAGCA 
ATAACTGTCT 
TCTATTTGAA 
CTTTTACAGA 
CAGTTCCTGT 
TTATGGGGGC 
TATCCCATCA 
TTAGAGGGTC 
ATGAGTATCG 
AAAAATATTG 
TTTGTTTTTT 
TAACACCTAA 
CTTTAGTCAA 
ATTTAACGAT 



CTTTTTTGAG 
AAAAATCGAT 
TTTGACTCTA 
GGATCCTAAT 
GTATAATCCT 
TTTAGATATA 
TCGAGGATTA 
TCAGGGATAT 
ATCGATGTTT 
TCGTTCCAAT 
TGGATCCTAT 
TATGAAAACC 
GGTTGGAGAG 
TGAGTTGCGT 
GGAAGGCGAT 
TGGAGTAAAA 
TTATATCTGT 
AGAGACATGG 
TATGTATGCT 
AGATACTTCT 
GTTAGATAGT 
AGCTTGTTTG 
TGTAACCACT 
TGTCTATGCA 
TACCGGACAA 



35 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH? 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Pro Tyr Thr Val lie Gly Asp Pro Ser Gly Thr Thr Val Phe Ser Ala 

1 5 10 15 

Gly Glu Leu Thr Leu Lys Asn Leu Asp Asn Ser lie Ala Ala Pro Leu 

20 25 30 

Ser Cys Phe Gly Asn Leu Leu Gly Ser Phe Thr Val Leu Gly Arg Gly 

35 40 45 

His Ser Leu Thr Phe Glu Asn He Arg Thr Ser Thr Asn Gly Ala Ala 
50 55 60 

Leu 
65 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Ala Ala Asn Gin Leu He Thr Leu Ser Asn Leu His Leu Ser Leu Ser 

15 10 15 

Ser Leu Leu Ala Asn Asn Ala Val 
20 



1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3324 
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(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Gly Tyr Thr Gly Asp lie Val Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Tyr Gly Asp lie lie Phe Asp 
1 5 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

Gly Tyr Ala Ala Glu He Met Val Pro Gin Gly He Tyr Asp Gly Glu 

1 5 10 15 

Thr Leu Thr Val Ser Phe Pro Tyr Thr Val He Gly Asp Pro Ser Gly 

20 25 30 

Thr Thr Val Phe Ser Ala Gly Glu Leu Thr Leu Lys Asn Leu Asp Asn 

35 40 45 

Ser He Ala Ala Leu Pro Leu Ser Cys Phe Gly Asn Leu Leu Gly 
50 55 60 

( 2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Met Ala Asn Gly Asn Asn Gin Pro Ala Gin Ser Ser Lys Leu Leu Lys 
15 10 15 



lie Asn Asp Gly Glu Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Ala Asn Gly Ser Ser Thr Leu Tyr Gin Asn Val Thr He Glu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Lys Leu Ser Val Asn Ser Leu Ser Gin Thr 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

val He Gly Ser Thr Thr Ala Gly Ser Val Thr He Ser Gly Pro He 

1 5 10 15 

Phe Phe Glu Asp Leu Asp Asp Thr Ala Tyr Asp Arg Tyr Asp Trp Leu 

20 25 30 

Gly Ser Asn Gin Lys He Asn Val Leu Lys Leu Gin Leu 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 cimino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Val He Gly Ser Thr Thr Ala Gly Ser Val Thr He Ser Gly Pro He 
15 10 15 



Phe Phe Glu Asp Leu Asp Asp Thr Ala Tyr Asp Arg Tyr Asp Trp Leu 

20 25 
Gly ser Asn Gin Lys He Asn Val Leu Lys Leu Gin Leu Gly Thr Lys 

35 40 45 

pro Pro Ala Asn Ala Pro Ser Asp Leu Thr Leu Gly Asn Glu Met Pro 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

{ D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Asp Pro Asn Thr Ala Asn Asn Gly Pro Tyr 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 458 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Gly Gly Ala Cys Gin Val Val Thr Ser Phe Ser Ala Met Ala Asn Glu 

Ala Pro He Ala Phe Val Ala Asn Val Ala Gly Val Arg Gly Gly. Gly 

20 25 I. 

lie Ala Ala Val Gin Asp Gly Gin Gin Gly Val Ser Ser Ser Thr Ser 

35 40 45 

Thr Glu Asp pro Val Val Ser Phe Ser Arg Asn Thr Ala Val Glu Phe 

en 55 6 

Asp Gly Asn Val Ala Arg Val Gly Gly Gly lie Tyr Ser Tyr Gly Asn 

val Ala Phe Leu Asn Isn Gly Lys Thr Leu Phe Leu Asn Asn Val Ala 

85 90 
ser pro Val Tyr He Ala Ala Lys Gin Pro Thr Ser Gly Gin Ala Ser 
100 105 



Ash Thr ser Asn Asn Tyr Gly Asp Gly Gly Ala He Phe Cys Lys Asn 

Gly Ala cln Ala Gly Ser Asn III Ser Gly Ser Val Ser Phe Asp Gly 

1 -10 135 
Glu Gly val val Phe Phe Ser Ser Asn Val Ala Ala Gly Lys Gly Gly 
1 150 1 
Ala lie Tyr Ala Lys Lys Leu Ser Val Ala Asn Cys Gly Pro Val Gin 

Phe Leu Arg Asn He Ala Asn Asp Gly Gly Ala He Tyr Leu Gly Glu 
180 185 1^" 



i HO X o -J 

ser Gly Glu Leu Ser Leu Ser Ala Asp Tyr Gly Asp lie He Phe Asp 

Ala 
220 
Ser 

00^ 230 235 

Thr Thr Leu Arg Ala Lys Ala Gly His Gin He Leu Phe Asn Asp Pro 
245 250 



195 200 

Gly Asn Leu Lys Arg Thr Ala Lys Glu Asn Ala Ala Asp Val Asn Gly 

9in 215 

val Thr val Ser Ser Gin Ala He Ser Met Gly Ser Gly Gly Lys He 



lie Glu Met Ala Asn Gly Asn Asn Gin Pro Ala Gin Ser Ser Lys Leu 

265 270 



260 ^ — 

Leu Lys He Asn Asp Gly Glu Gly Tyr Thr Gly Asp lie Val Phe Ala 

275 280 285 

Asn Gly ser Ser Thr Leu Tyr Gin Asn Val Thr He Glu Gin Gly Arg 

290 295 300 

lie val Leu Arg Glu Lys Ala Lys Leu Ser Val Asn Ser Leu Ser Gin 
305 310 315 •'•'O 

* Thr Gly Gly Ser Leu Tyr Met Glu Ala Gly Ser Thr Trp Asp Phe Val 

325 330 335 

Thr Pro Gin Pro Pro Gin Gin Pro Pro Ala Ala Asn Gin Leu He Thr 

340 345 350 

Leu Ser Asn Leu His Leu Ser Leu Ser Ser Leu Leu Ala Asn Asn Ala 

355 360 365 

Val Thr Asn Pro Pro Thr Asn Pro Pro Ala Gin Asp Ser His Pro Ala 
370 375 
10 val He Gly Ser Thr Thr Ala Gly Ser Val Thr He Ser Gly Pro lie 
385 390 395 400 

Phe Phe Glu Asp Leu Asp Asp Thr Ala Tyr Asp Arg Tyr Asp Trp Leu 

405 410 415 

Gly Ser Asn Gin Lys He Asn Val Leu Lys Leu Gin Leu Gly Thr Lys 

^ 420 425 430 

Pro Pro Ala Asn Ala Pro Ser Asp Leu Thr Leu Gly Asn Glu Met Pro 

435 440 445 

Lys Tyr Gly Tyr Gin Gly Ser Trp Lys Leu 
450 455 



15 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : unknown 



20 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

Leu Lys Ala Thr Trp Thr Lys Thr Gly Tyr Asn Pro Gly Pro Glu Arg 

1 5 10 15 

Val Ala ser Leu Val Pro Asn Ser Leu Trp Gly Ser lie Leu Asp He 
20 25 30 

9c Arg ser Ala His Ser Ala He Gin Ala Ser Val Asp Gly Arg Ser Tyr 
^ 40 45 



35 

Cys Arg Gly Leu Trp Val Ser Gly Val Ser Asn Phe Phe Tyr His Asp 

Arg a2p Ala Leu Gly Gin Gly Tyr Arg Tyr He Ser Gly Gly Tyr S^r 
• 65 70 . 75 , 

Leu Gly Ala Asn Ser Tyr Phe Gly Ser Ser Met Phe Gly Leu Ala Phe 

85 90 95 

Thr Glu val Phe Gly Arg Ser Lys Asp Tyr Val Val Cys Arg Ser Asn 

100 105 11" 

His His Ala Cys He Gly Ser Val Tyr Leu Ser Thr Gin Gin Ala Leu 

115 120 125 

Cys Gly Ser Tyr Leu Phe Gly Asp Ala Phe He Arg Ala Ser Tyr Gly 

130 135 140 

Phe Gly Asn Gin His Met Lys Thr Ser Tyr Thr Phe Ala Glu Glu Ser 
145 150 155 -"-O" 

Asp val Arg Trp Asp Asn Asn Cys Leu Ala Gly Glu He Gly Ala Gly 
165 170 = 

-iS Leu Pro He Val He Thr Pro Ser Lys Leu Tyr Leu Asn Glu Leu Arg 
180 185 
Pro Phe val Gin Ala Glu Phe Ser Tyr Ala Asp His Glu Ser Phe Thr 
195 200 205 
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Glu Glu Gly Asp Gin Ala Arg Ala Phe Lys S r Gly His Leu Leu Asn 

210 215 220 

Leu Ser Val Pro Val Gly Val Lys Phe Asp Arg Cys Ser Ser Thr His 

225 230 235 240 

Pro Asn Lys Tyr Ser Phe Met Ala Ala Tyr He Cys Asp Ala Tyr Arg 

245 250 255 

Thr He Ser Gly Thr Glu Thr Thr Leu Leu Ser His Gin Glu Thr Trp 

260 265 270 

Thr Thr Asp Ala Phe His Leu Ala Arg His Gly Val Val Val Arg Gly 

275 280 285 
ser Met Tyr Ala Ser Leu Thr Ser Asn He Glu Val Tyr Gly Hrs Gly 

290 295 300 
Arg Tyr Glu Tyr Arg Asp Ala Ser Arg Gly Tyr Gly Leu Ser Ala Gly 

Ser Arg Val Arg Phe 

325 



10 



15 



20 



25 



30 



35 
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vmUT Tg CLAIMED IS; 

1. An isolated Chlamydia species HMW prot in 
wherein the apparent molecular weight is about 105-115 kDa, 

5 as determined by SDS-PAGE, or a fragment or analogue thereof. 

2. The protein of claim 1 which is substantially 

purified. 

10 3. The protein of claim 1 wherein the chlamydia 

species is Chlamydia trachomatis. Chlamydia pecorum. 
Chlamydia psittaci or Chlamydia pneumoniae. 

4. The protein of claim 1 having an amino acid 
15 sequence shown in SEQ ID No. : 2, 15 or 16 or a fragment or 

conservatively substituted analogue thereof. 

5 . The fragment of claim 4 , having an amino acid 
sequence shown in SEQ ID No.: 3, 17 or 25-37. 

20 

6. The protein of claim 1 recognizable by an 
antibody preparation that specifically binds to a peptide 
having an amino acid sequence of SEQ ID No.: 2, 15 or 16 or a 
fragment or conservatively substituted analogue thereof . 

25 

7. An isolated nucleic acid molecule encoding the 
HMW protein of claim 1 or a fragment or an. analogue thereof. 

8. The nucleic acid molecule of claim 7 wherein 
30 the Chlamydia species is Chlamydia trachomatis. Chlamydia 

pecorum. Chlamydia psittaci or Chlamydia pneumoniae. 

9 . The nucleic acid molecule of claim 7 wherein 
the encoded protein has the amino acid sequence of SEQ ID No: 

35 2, 15 or 16 or a fragment or conservatively substituted 
analogue thereof. 
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10. An isolated nucleic acid molecule having a 
sequence selected from the group consisting of: 

a) a DNA sequence of SEQ ID No.: 1, 23 or 24, or 
a complementary sequence or fragment thereof; 
5 b) a DNA sequence encoding a HMW protein having 

the amino acid sequence of SEQ ID No. : 2, 15 
or 16 or fragment thereof; 

c) a DNA sequence encoding a deduced amino acid 
sequence of SEQ ID No. : 2, 15 or 16 or the 

10 complimentary or degenerate sequence thereto 

or fragment thereof; and 

d) a nucleic acid sequence which hybridizes under 
stringent conditions to any one of the 
sequences defined in a) , b) or c) . 

15 

11. A recombinant expression vector adapted for 
transformation of a host comprising the nucleic acid molecule 
of claim 7 or 10. 



20 



25 



12. A recombinant expression vector adapted for 
transformation of a host comprising the nucleic acid molecule 
of claim 7 or 10 and expression means operatively coupled to 
the nucleic acid molecule for expression by the host of HMW 
protein or a fragment or analogue thereof. 

13. The expression vector of claim 12, wherein the 
expression means includes a nucleic acid portion encoding a 
leader sequence for secretion from the host of the HMW 
protein or a fragment or analogue thereof. 

14. A transformed host cell containing an 
expression vector of claim 12. 

15. A transformed host cell containing an 
35 expression vector of claim 13. 



30 
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16. An isolated recombinant protein or fragment or 
analogue thereof producible by the transformed host of claim 
14. 



5 17. An isolated recombinant protein or fragment or 

analogue thereof producible by the transformed host of claim 
IS- 
IS. A recombinant vector for delivery of a HMW 
10 protein or fragment or analogue thereof to a host comprising 
the nucleic acid molecule of claim 7 or 10. 

19. An immunogenic composition, comprising at 
least one component selected from the group consisting of: 
15 a) an isolated HMW protein, wherein the apparent 

molecular weight is about 105-115 kDa, as 
determined by SDS-PAGE, or a fragment or 
conservatively substituted analogue thereof; 

b) an isolated nucleic acid molecule encoding a 
20 HMW protein of a) or a fragment or analogue 

theirepf ; 

c) an isolated nucleic acid molecule having the 
sequence of SEQ ID No.: 1, 2 3 or 24, the 
complimentary sequence thereto or a nucleic 

2 5 acid sequence which hybridizes under stringent 

conditions thereto or fragment thereof; 

d) an isolated recombinant protein or fragment or 
analogue thereof producible in a transformed 
host comprising an expression vector 

30 comprising a nucleic acid molecule as defined 

in b) or c) and expression means operatively 
coupled to the nucleic acid molecule for 
expression by the host qf said HMW protein or 
the fragment or analogue thereof; 

35 e) a recombinant vector comprising a nucleic acid 

sequence of b) or c) encoding a HMW protein or 
fragment or analogue thereof ; and 

_ 92 - PENY3-594263.1 



f) a transforined cell comprising the vector of 
e), 

and optionally an adjuvant, and a pharmaceutically acceptable 
carrier or diluent therefor, said composition producing an 
5 immune response when administered to a host, 

20. An antigenic composition, comprising at least 

one component selected from the group consisting of: 

a) an isolated HMW protein, wherein the apparent 
10 molecular weight is about 105-115 kDa as 

determined by SDS-PAGE, or a fragment or 
analogue thereof; 

b) an isolated nucleic acid molecule encoding a 
HMW protein of a) , or a fragment or analogue 

15 thereof ; 

c) an isolated nucleic acid molecule having the 
sequence of SEQ ID No.: 1, 22, 23 or 24, the 
complimentary or degenerate sequence thereto 
or a nucleic acid sequence which hybridizes 

20 under stringent conditions thereto; 

d) an isolated recombinant protein or fragment or 
analogue thereof producible in a transformed 
host comprising an expression vector 
comprising a nucleic acid molecule as defined 

25 in b) or C) and expression means operatively 

coupled to the nucleic acid molecule for 
expression by the host of said HMW protein or 
the fragment or analogue thereof; 

e) a recombinant vector, comprising a nucleic 
30 acid sequence of b) or c) encoding a HMW 

protein or fragment or analogue thereof; and 

f) a transformed cell comprising the vector of 
e), 

and optionally an adjuvant, said composition producing an 
35 immune response when administered to a host. 
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21. A method of producing an immune response in an 
animal comprising administering to said animal an effective 
amount of the antigenic composition of claim 20 or the 
immunogenic composition of claim 19. 

5 

22. The method of claim 21 wherein the animal is a 
mammal or a bird. 

23. Antisera raised against the antigenic 

10 composition of claim 20 or the immunogenic composition of 
claim 19. 

24. Antibodies present in the antisera of claim 23 
that specifically bind a HMW protein or a fragment or 

15 analogue thereof. 

25. A diagnostic reagent selected from the group 
consisting of: the protein of claim 1, the nucleic acid 
molecule of claim 10, the immunogenic composition of claim 

20 20, the antigenic composition of claim 19, the antisera of 
claim 23, the vector of claim 12, the transformed cell of 
claim 14, and the antibodies of claim 24. 

26. A method for detecting anti-Chlamydia 
25 antibodies in a test sample comprising the steps of: 

a) contacting said sample with the HMW protein of 
claim 1, the antigenic composition of claim 20 
or the immunogenic compositipn of claim .19 to 
form, in the presence of said antibodies, 

30 Chlamydia antigen: anti-Chlamydia antibody 

immunocomplexes, and further, 

b) either detecting the presence of or measuring 
the amount of said immunocomplexes formed 
during step a) as an indication of the 

35 presence of said anti-Chlamydia antibodies in 

the test sample. 
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27. A diagnostic kit for detecting antibodies to 
Chlamydia, said kit comprising the HMW protein of claim 1, 
the antigenic composition of claim 20 or the immunogenic 
composition of claim 19, a container means for contacting 

5 said protein or composition with a test sample suspected of 
having said antibodies and reagent means for detecting or 
measuring Chlamydia antigen: anti-Chlamydia antibody 
immunocomplexes formed between said protein or composition 
and said antibodies. 

10 

28. A method for detecting the presence of 
Chlamydia in a test sample comprising the steps of: 

a) contacting said test sample with the 
antibodies of claim 24 for a time sufficient 

15 to allow said antibodies to bind Chlamydia, if 

present, and to form a Chlamydia: anti- 
Chlamydia antibody immunocomplexes, and 
further , 

b) either detecting the presence of or measuring 
20 the amount of said immunocomplexes formed 

during step a) as an indication of the 
presence of said Chlamydia in the test sample. 

29. A diagnostic kit for detecting the presence of 
25 Chlamydia, said kit comprising the antibodies of claim 24, 

container means for contacting said antibodies with a test 
sample suspected of having said Chlamydia and reagent means 
for detecting or measuring Chlamydia: anti-Chlamydia antibody 
immunocomplexes formed between said antibodies and said 
30 Chlamydia, 

30. A pharmaceutical composition comprising an 
effective amount of at least one component selected from the 
group consisting of: 

35 a) a HMW protein, wherein the apparent molecular 

weight is about 105-115 kDa, as determined by 
SDS-PAGE, or a fragment or analogue thereof; 
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b) an isolated nucleic acid molecule encoding a 
HMW protein of a) , or a fragment or analogue 
thereof ; 

c) an isolated nucleic acid molecule having the 
5 sequence of SEQ ID No.: 1, 23 or 24, the 

complimentary or degenerate sequence thereto 
or a nucleic acid sequence which hybridizes 
under stringent conditions thereto; 

d) an isolated recombinant HMW protein or 

10 fragment or analogue thereof producible in a 

transformed host comprising an expression 
vector comprising a nucleic acid molecule as 
defined in b) or c) and expression means 
operatively coupled to the nucleic acid 

15 molecule for expression by the host of said 

HMW protein or the fragment or analogue 
thereof ; 

e) a recombinant vector, comprising a nucleic 
acid sequence of b) or c) encoding a HMW 

20 protein or fragment or analogue thereof; 

f ) a transformed cell comprising the vector of 
e) , and 

g) antibodies that specifically bind the 
component of a) , b) , c) , d) , e) or f ) , 

25 and optionally a pharmaceutical ly acceptable carrier or 
diluent therefor. 

31. A vaccine composition comprising an effective 
amount of at least one component selected from the group 

30 consisting of: 

a) a HMW protein, wherein the apparent molecular 
weight is about 105-115 kDa, as determined by 
SDS^PAGE, or a fragment or analogue thereof ; 

b) an isolated nucleic acid molecule encoding a 
35 HMW protein of a) or a fragment or analogue 

thereof; 
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c) an isolated nucleic acid molecule having the 
sequence of SEQ ID No.: 1, 23 or 24, the 
complimentary or degenerate sequence thereto 
or a nucleic acid sequence which hybridizes 

5 under stringent conditions thereto; 

d) an isolated recombinant HMW protein or 
fragment or analogue thereof producible in a 
transformed host comprising an expression 
vector comprising a nucleic acid molecule as 

10 defined in b) or c) and expression means 

operatively coupled to the nucleic acid 
molecule for expression by the host of said 
HMW protein or the fragment or analogue 
thereof ; 

15 e) a recombinant vector, comprising a nucleic 

acid sequence of b) or c) encoding a HMW 
protein or fragment or analogue thereof; 
f ) a transformed cell comprising the vector of 
e) , and 

20 g) antibodies that specifically bind the 

component of a) , b) , c) , d) , e) , or f ) , 
and optionally an adjuvant, and a pharmaceutical ly acceptable 
carrier or diluent therefor, wherein the vaccine produces an 
immune response when administered to a host. 

25 

32. A method of preventing, treating or 
ameliorating a disorder related to Chlamydia in a host in 
need of such treatment comprising administering to said host, 
an effective amount of the pharmaceutical composition of 

30 claim 30 or the vaccine composition of claim 31. 

33. The method of claim 32, wherein the disorder 
is selected from the group consisting of a Chlamydia 
bacterial infection, conjunctivitis, urethritis, 
lymphogranuloma venereum (LGV) , cervicitis, epididymitis, 

35 endometritis, pelvic inflammatory disease (PID) , salpingitis, 
tubal occlusion, infertility, cervical cancer, 
arteriosclerosis and atherosclerosis. 
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34. The method of claim 33 wherein the host is a 
bird or a mammal. 

35. The composition of any one of claims 19, 20, 
5 30 or 31 formulated for in vivo administration to a host to 

confer protection against disease caused by a species of 
Chlamydia. 

36. The composition of any one of claims 19, 20, 
10 30, or 31 wherein the species is selected from the group 

consisting of Chlamydia trachomatis^ Chlamydia pecorum. 
Chlamydia psittaci and Chlamydia pneumoniae. 

37. The composition of any one of claims 19, 20, 
15 30 or 31 formulated as a microparticle, capsule, or liposome 

preparation. 

38. The protein of any one of claims 1, 4, 6, 16 
and 17, wherein the protein binds to heparin or heparan- 

20 sulfate. 

39. The protein of any one of claims 1, 4, 6, 16 
or 17, wherein the protein is an outer membrane protein. 

25 40. A method for determining the presence of 

nucleic acid encoding a HMW protein or a fragment or analogue 
thereof in a sample, comprising the steps of: 

a) contacting the sample with the nucleic acid 
molecule of claim 7 or 10 or any fragment 
30 thereof or complementary thereto to produce 

duplexes comprising the nucleic acid molecule 
and any said nucleic acid molecule encoding 
the HMW protein in the sample and specifically 
hybridizable therewith; and 
35 b) determining the production of duplexes. 
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41. A diagnostic kit for determining the presence 
of nucleic acid encoding a HMW prot in or fragment or 
analogue thereof in a sample, comprising: 

a) the nucleic acid molecule of claim 7 or 10 or 
5 any fragment thereof or complementary thereto; 

b) means for contacting the nucleic acid with the 
sample to produce duplexes comprising the 
nucleic acid molecule and any said nucleic 
acid encoding the HMW protein in the sample 

10 and specifically hybridizable therewith; and 

c) means for determining the production of 

duplexes. 



15 
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ABSTRACT OF THE DI8 CL08CRE 

A high molecular weight ("HMW") protein of 
Chlamydia, the amino acid sequence thereof, and antibodies 
that specifically bind the HMW protein are disclosed as well 
5 as the nucleic acid sequence encoding the same. Also 
disclosed are prophylactic and therapeutic compositions, 
comprising the HMW protein, a fragment thereof, or an 
antibody that specifically binds the HMW protein or a portion 
thereof, or the nucleotide sequence encoding the HMW protein 
10 or a fragment thereof, including vaccines. 
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Figure 2. Nucleic acid sequence of HMW protein 



GGGCAAAACTCTTCCCCCCGGGATTTATATGGGAAAGGGGAAACTTTGGC 
CCGTATTCAAGCGCCACGGGTTTTGGGGCGGAATGAATTTTTTCGTTCCG 
GAAAAAGTAATTCCCCGGGAACGTAGGGTATCGGTTTCATAGGCTCGCCA 
AATGGGATATAGGTGGAAAGGTAAAAAAAACTGAGCCAAGCAAAGGATAG 
AGAAGTCTTGTAATCATCGCAGGTTAAAGGGGGGATGTTATTTTAGCCTG 
CAAATAGTGTAATTATTGGATCCTGTAAAGAGAAAAGGACGAATGCGCTG 
AAGATAAGAACATTTATTGATATTAAATTATTAATTTTTTATGAAGCGGA 
GTAATTAATTTTATCTCTCAGCTTTTGTGTGATGCAAACGTCTTTCCATA 
AGTTCTTTCTTTCAATGATTCTAGCTTATTCTTGCTGCTCTTTAAATGGG 
GGGGGATATGCAGCAGAAATCATGGTTCCTCAAGGAATTTACGATGGGGA 
GACGTTAACTGTATCATTTCCCTATACTGTTATAGGAGATCCGAGTGGGA 
CTACTGTTTTTTCTGCAGGAGAGTTAACATTAAAAAATCTTGACAATTCT 
ATTGCAGCTTTGCCTTTAAGTTGTTTTGGGAACTTATTAGGGAGTTTTAC 
TGTTTTAGGGAGAGGACACTCGTTGACTTTCGAGAACATACGGACTTCTA 
CAAATGGGGCAGCTCTAAGTAATAGCGCTGCTGATGGACTGTTTACTATT 
GAGGGTTTTAAAGAATTATCCTTTTCCAATTGCAATTCATTACTTGCCGT 
ACTGCCTGCTGCAACGACTAATAAGGGTAGCCAGACTCCGACGACAACAT 
CTACACCGTCTAATGGTACTATTTATTCTAAAACAGATCTTTTGTTACTC 
AATAATGAGAAGTTCTCATTCTATAGTAATTTAGTCTCTGGAGATGGGGG 
AGCTATAGATGCTAAGAGCTTAACGGTTCAAGGAATTAGCAAGCTTTGT6 
TCTTCCAAGAAAATACTGCTCAAGCTGATGGGGGAGCTTGTCAAGTAGTC 
ACCAGTTTCTCTGCTATGGCTAACGAGGCTCCTATTGCCTTTGTAGCGAA 
TGTTGCAGGAGTAAGAGGGGGAGGGATTGCTGGTGTTCAGGATGGGCAGC 
AGGGAGTGTCATCATCTACTTCAACAGAAGATCCAGTAGTAAGTTTTTCC 
AGAAATACTGCGGTAGAGTTTGATGGGAACGTAGCCCGAGTAGGAGGAGG 
GATTTACTCCTACGGGAACGTTGCTTTCCTGAATAATGGAAAAACCTTGT 
TTCTCAACAATGTTGCTTCTCCTGTTTACATTGCTGCTAAGCAACCAACA 
AGTGGACAGGCTTCTAATACGAGTAATAATTACGGAGATGGAGGAGCTAT 
CTTCTGTAAGAATGGTGCGCAAGCAGGATCCAATAACTCTGGATCAGTTT 
CCTTTGATGGAGAGGGAGTAGTTTTCTTTAGTAGCAATGTAGCTGCTGGG 
AAAGGGGGAGCTATTTATGdCAAAAAGCTCTCGGTTGCTAACTGTGGCCC 
TGTACAATTTTTAAGGAATATCGCTAATGATGGTGGAGCGATTTATTTAG 
, GAGAATCTGGAGAGCTCAGTTTATCTGCTGATTATGGAGATATTATTTTC 
GATGGGAATCTTAAAAGAACAGCCAAAGAGAATGCTGCCGATGTTAATGG 
CGTAACTGTGTCCTCACAAGCCATTTCGATGGGATCGGGAGGGAAAATAA 



CGACATTAAGAGCTAAAGCAGGGCATCAGATTCTCTTTAATGATCCCATC 

GAGATGGCAAACGGAAATAACCAGCCAGCGCAGTCTTCCAAACTTCTAAA 

AATTAACGATGGTGAAGGATACACAGGGGATATTGTTTTTGCTAATGGAA 

GCAGTACTTTGTACCAAAATGTTACGATAGAGCAAGGAAGGATTGTTCTT 

CGTGAAAAGGCAAAATTATCAGTGAATTCTCTAAGTCAGACAGGTGGGAG 

TCTGTATATGGAAGCTGGGAGTACATGGGATTTTGTAACTCCACAACCAC 

CACAACAGCCTCCTGCCGCTAATCAGTTGATCACGCTTTCCAATCTGCAT 

TTGTCTCTTTCTTCTTTGTTAGCAAACAATGCAGTTACGAATCCTCCTAC 

CAATCCTCCAGCGCAAGATTCTCATCCTGCAGTCATTGGTAGCACAACTG 

CTGGTTCTGTTACAATTAGTGGGCCTATCTTTTTTGAGGATTTGGATGAT 

ACAGCTTATGATAGGTATGATTGGCTAGGTTCTAATCAAAAAATCAATGT 

CCTGAAATTACAGTTAGGGACTAAGCCCCCAGCTAATGCCCCATCAGATT 

TGACTCTAGGGAATGAGATGCCTAAGTATGGCTATCAAGGAAGCTGGAAG 

CTTGCGTGGGATCCTAATACAGCAAATAATGGTCCTTATACTCTGAAAGC 

TACATGGACTAAAACTGGGTATAATCCTGGGCCTGAGCGAGTAGCTTCTT 

TGGTTCCAAATAGTTTATGGGGATCCATTTTAGATATACGATCTGCGCAT 

TCAGCAATTCAAGCAAGTGTGGATGGGCGCTCTTATTGTCGAGGATTATG 

GGTTTCTGGAGTTTCGAATTTCTTCTATCATGACCGCGATGCTTTAGGTC 

AGGGATATCGGTATATTAGTGGGGGTTATTCCTTAGGAGCAAACTCCTAC 

TTTGGATCATCGATGTTTGGTCTAGCATTTACCGAAGTATTTGGTAGATC 

TAAAGATTATGTAGTGTGTCGTTCCAATCATCATGCTTGCATAGGATCCG 

TTTATCTATCTACCCAACAAGCTTTATGTGGATCCTATTTGTTCGGAGAT 

GCGTTTATCCGTGCTAGCTACGGGTTTGGGAATCAGCATATGAAAACCTC 

ATATACATTTGCAGAGGAGAGCGATGTTCGTTGGGATAATAACTGTCTGG 

5 ctggagagattggagcgggattaccgattgtgattactccatctaagctc 
tatttgaatgagttgcgtcctttcgtgcaagctgagttttcttatgccga 
tcatgaatcttttacagaggaaggcgatcaagctcgggcattcaagagcg 

GACATCTCCTAAATCTATCAGTTCCTGTTGGAGTGAAGTTTGATCGATGT 

tctagtacacatcctaataaatatagctttatggcggcttatatctgtga 
0 tgcttatcgcaccatctctggtactgagacaacgctcctatcccatcaag 
agacatggacaacagatgcctttcatttagcaagacatggagttgtggtt 
agaggatctatgtatgcttctctaacaagtaatatagaagtatatggcca 
tggaagatatgagtatcgagatgcttctcgaggctatggtttgagtgcag 
gaagtagagtccggttctaaaaatattggttagatagttaagtgttagcg 
5 atgcctttttctttgagatctacatcattttgttttttagcttgtttgtg 
ttcctattcgtatggattcgcgagctctcctcaagtgttaacgcctaatg 
taaccactccttttaagggagacgatgtttacttgaatggagactgcgct 



TTTGTCAATGTCTATGCAGGAGCTGAAGAAGGTTCGATTATCTCAGCTAA 
TGGCGACAATTTAACGATTACCGGACAAAACCATACATTATCATTTACAG 
ATTCTCAAGGGCCAGTTCTTCAAAATTATGCCTTCATTTCAGCAGGAGAG 
ACACTTACTCTGAGAGATTTTTCGAGTCTGATGTTCTCGAAAAATGTTTC 
5 TTGCGGAGAAAAGGGAATGATCTCCGGGAAAACCGTGAGTATTTCCGGAG 
CAGGCGAAGTGATTTTCTGGGATAACTCCGTGGGGTATTCTCCTTTATCT 
ACTGTGCCAACCTCATCATCAACTCCGCCTGCTCCAACAGTTAGTGATGC 
TCGGAAAGGGTCTATTTTTTCTGTAGAGACTAGTTTGGAGATCTCAGGCG 
TCAAAAAAGGGGTCATGTTCGATAATAATGCCGGGAATTTCGGAACAGTT 

10 TTTCGAGGTAAGAATAATAATAATGCTGGTGGTGGAGGCAGTGGGTTCCG 
CTACACCATCAAGTACGACTTTTACAGTTAAAAACTGTAAAGGGAAAGTT 
TCTTTCACAGATAACGTAGCCTCTTGCGGAGGCGGAGTGGTTTATAAAGG 
CATTGTGCTTTTCAAAGACAATGAAGGAGGCATATTCTTCCGAGGGAACA 
CAGCATACGATGATTTAAGGATTCTTGCTGCTACTAATCAGGATCAGAAT 

15 ACGGAGACAGGAGGCGGTGGAGGAGTTATTTGCTCTCCAGATGATTCTGT 
AAAGTTTGAAGGCAATAAAGGTTCTATTGTTTTTGATTACAACTTTGCAA 
AAGGCAGAGGCGGAAGCATCCTAACGAAAGAATTC 



Figure 3 . Amino Acid Sequence of HMW Protein 

MQTSFHKFFLSMILAYSCCSLNGGGYAAEIMVPQGIYDGETLTVSFPYTV 
IGDPSGTTVFSAGELTLKNLDNSIAALPLSCFGNLLGSFTVLGRGHSLTF 
5 ENIRTSTNGAALSNSAADGLFTIEGFKELSFSNCNSLLAVLPAATTNKGS 
QTPTTTSTPSNGTIYSKTDLLLLNNEKFSFYSNLVSGDGGAIDAKSLTVQ 
GISKLCVFQENTAQADGGACQWTSFSAMANEAPIAFVANVAGVRGGGIA 
AVQDGQQGVSSSTSTEDPWSFSRNTAVEFDGNVARVGGGIYSYGNVAFL 
NNGKTLFLNNVASPVYIAAKQPTSGQASNTSNNYGDGGAIFCKNGAQAGS 

10 NNSGSVSFDGEGWFFSSNVAAGKGGAIYAKKLSVANCGPVQFLRNIAND 
GGAIYLGESGELSLSADYGDIIFDGNLKRTAKENAADVNGVTVSSQAISM 
GSGGKITTLRAKAGHQILFNDPIEMANGNNQPAQSSKLLKINDGEGYTGD 
IVFANGSSTLYQNVTIEQGRIVLREKAKLSVNSLSQTGGSLYMEAGSTWD 
FVTPQPPQQPPAANQLITLSNLHLSLSSLLANNAVTNPPTNPPAQDSHPA 

15 VIGSTTAGSVTISGPIFFEDLDDTAYDRYDWLGSNQKINVLKLQLGTKPP 
ANAPSDLTLGNEMPKYGYQGSWKLAWDPNTANNGPYTLKATWTKTGYNPG 
PERVASLVPNSLWGSILDIRSAHSAIQASVDGRSYCRGLWVSGVSNFFYH 
DRDALGQGYRYISGGYSLGANSYFGSSMFGLAFTEVFGRSKDYWCRSNH 
HACIGSVYLSTQQALCGSYLFGDAFIRASYGFGNQHMKTSYTFAEESDVR 

2 0 WDNNCLAGEIGAGLPIVITPSKLYLNELRPFVQAEFS YADHESFTEEGDQ 
ARAFKSGHLLNLSVPVGVKFDRCSSTHPNKYSFMAAYICDAYRTISGTET 
TLLSHQETWTTDAFHLARHGVWRGSMYASLTSNIEVYGHGRYEYRDASR 
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EIMVPQGIYDGETLTVSFPYTVIGDPSGTTVF 



100 * 



SAGELTI_KNLDNSIAALPLSCFGNLLGSFTVLGRGHSLTFENIRTSTNGAALSNSA/|^9GL 

D NS 



FTIEGFKELSFSNCNSLI^VLPMTTNl^GSQTPmSTPSNQIIYSKTDLLLLNNEKFSFY 

P N S 



200 * 



SNLVSGDQGAIDAKSLTVQGISKLCVFQENTAQADGGACQVVTSFSAMANEAPIAFyA 
S 

NVAGVRGGGIAAVQDGQQGVSSSTSTEDPWSFSRNTAVEFCX3NVARVGGGIYSYGNV 



*300 

AFmNGKTLFLNNVASPWIAAKQPTSGQASMSNNYG[X3GAIFCKNGAQ/^SJ^^^ 

E if p [A] 

END [A] 

400* 

VSFDGEGWFFSSNVAAGKGGAIYAKKLSVANCGPVQFLRNIANDGGAIYLGESGELSL 

I 9 
L G 

SADYGD IFDGNLKRTAKENAADVNGVTVSSQAISMGSGGKITTLRAKAGHQILFNDPIE 
M 

500* EcoRI - I 

MANGNNQPAQSSKLiKINIXBEGYTGDIVFANS^TLYQN^ 
EP N 
^1^ N 



• 



SQTGGSLYMEAGST\(VDFVTPQPPQQP PMNC]|lITLSNLHLSI SRI I ANNAv|rM PPTMP 

L S 



600* 



PAQDSHPAVIGSTTAG^VTISGPIFFEDLDDTAYDRYDWLGSNQKINVLKLQLGTKPPA 

P 1= p 6^ 

D QS 

700* 

NAPSDLTLGNEMPKYGYQGSWKL^WDPNTANNGPYTIKATWTKTGYNPGPERVASLV 



PNSLWGSILDIRSAHSAIQASVDGRSYCRGLWVSGVSNFIfYHDRDALGQGYRYISGGYS 

S 

800* 

LGANSYFGSSMFGLAFTEVFGRSKDYWCRSNHHACIGSVYLSTQQALCGSYLFGDAFI 
RASYGFGNQHMKTSYTFAEESDVRWDNNCL^GEIGAGLPIVITPSKLYLNELRPFVQAEF 

I Y Y T 

C V V 

900* 

SYADHESFTEEGDQARAFJjCSGHUJjlL^PVGVKFDRCSSTHPNKYSFMAAYICDAYRTI 

R lOl G 
R M 6 

1000* 

SGTETTU-SHQETWTTDAFHLARHGVWRGSMYASLTSNIEVYGHGRYEYRDASRGYGL 

i t 

Q I f 

1013* 

SAGSpVRF 
K 



